Charles Explorer logo
🇬🇧

On segmentation of Czech sentences

Publication at Faculty of Mathematics and Physics |
2006

Abstract

On segmentation of Czech sentences. The paper introduces a concept of segments, linguistically motivated and easily detectable language units.

These segments may be subsequently combined into clauses and thus provide a structure of a complex sentence with regard to the mutual relationship of individual clauses. The method has been developed for Czech as a language representing languages with relatively high degree of word-order freedom.

The paper introduces important terms and describes a segmentation chart. It also contains a simple set of rules applied for the segmentation of a small set of Czech sentences.

The segmentation results are evaluated against a small hand-annotated corpus of Czech complex sentences.