On segmentation of Czech sentences. The paper introduces a concept of segments, linguistically motivated and easily detectable language units.
These segments may be subsequently combined into clauses and thus provide a structure of a complex sentence with regard to the mutual relationship of individual clauses. The method has been developed for Czech as a language representing languages with relatively high degree of word-order freedom.
The paper introduces important terms and describes a segmentation chart. It also contains a simple set of rules applied for the segmentation of a small set of Czech sentences.
The segmentation results are evaluated against a small hand-annotated corpus of Czech complex sentences.