Úvod – korpusová lingvistika, k čemu korpusy využívat
Český národní korpus
Slovenský národní korpus
Národní korpus polského jazyka
Národní korpus ruského jazyka
Korpusy ostatních slovanských jazyků (studentské prezentace)
Projekt
Obecně o paralelních korpusových zdrojích slovanských jazyků (IC, OPUS, PARASOL)
InterCorp
Srovnání národních tagsetů cs-pl, cs-ru, cs-sk,
Universal dependencies
Korpusový komparativní výzkum
The seminar is intended for students of Slavic languages and students of philological disciplines with an interest in Slavic languages and their investigation by means of corpora. No previous experience with corpora is expected.
During the course, students will acquire basic information about corpus linguistics, national corpora of Slavic languages and their application in practice. The seminar will primarily focus on the characteristics of the Czech, Slovak, Polish and Russian national corpus, as well as corpus resources of other Slavic languages according to students’ interest.
We will also practise working with each of the corpora we will cover. Furthermore, we will explore Slavic resources within parallel corpora in the context of comparing Slavic languages; firstly, we will familiarise ourselves with different approaches to the morphological description of particular languages (different tagsets), and secondly, we will focus on the use of corpus material in comparative linguistics.