Charles Explorer logo
🇬🇧

Koditex: A corpus of diversified texts

Publication at Faculty of Arts |
2019

Abstract

The article describes a new representative and reference 9-milion-corpus corpus of contemporary Czech Koditex. Koditex was designed to be as diverse as possible for the purpose of conducting a multidimensional analysis (MDA) of Czech.

At the topmost level, it is divided into three modes of communication: written language, spoken language, and web-based communication. In addition to the purpose of MDA, it could be used in conducting other language analyses.