Charles Explorer logo
🇬🇧

Semi-supervised Induction of Morpheme Boundaries in Czech Using a Word-Formation Network

Publication at Faculty of Mathematics and Physics |
2020

Abstract

This paper deals with automatic morphological segmentation of Czech lemmas contained in the word-formation network DeriNet. Capturing derivational relations between base and derived lemmas, and segmenting lemmas into sequences of morphemes are two closely related formal models of how words come into existence.

Thus we propose a novel segmentation method that benefits from the existence of the network; our solution constitutes new state-of-the-art for the Czech language.