Charles Explorer logo
🇬🇧

Czech National Corpus in 2020: Recent Developments and Future Outlook

Publication at Faculty of Arts |
2020

Abstract

The paper overviews the state of implementation of the Czech National Corpus (CNC) in all the main areas of its operation: corpus compilation, annotation, application development and user services. As the focus is on the recent development, some of the areas are described in more detail than the others.

Close attention is paid to the data collection and, in particular, to the description of web application development. This is not only because CNC has recently seen a significant progress in this area, but also because we believe that end-user web applications shape the way linguists and other scholars think about the language data and about the range of possibilities they offer.

This consideration is even more important given the variability of the CNC corpora.