Charles Explorer logo
🇨🇿

On the questions in developing computational infrastructure for Komi-Permyak

Publikace

Tento text není v aktuálním jazyce dostupný. Zobrazuje se verze "en".Abstrakt

There are two main written Komi varieties,

Permyak and Zyrian. These are mutuallyintelligible but derive from differentparts of the same Komi dialect continuum,representing the varieties prominent in thevicinity and in the cities of Syktyvkar and

Kudymkar, respectively. Hence, they sharea vast number of features, as well as themajority of their lexicon, yet the overlap intheir dialects is very complex. This paperevaluates the degree of difference in thesewritten varieties based on changes requiredfor computational resources in the descriptionof these languages when adapted fromthe Komi-Zyrian original. Primarily thesechanges include the FST architecture, butwe are also looking at its application to the

Universal Dependencies annotation schemein the morphologies of the two languages.