The article describes a new representative and reference 9-milion-corpus corpus of contemporary Czech Koditex. Koditex was designed to be as diverse as possible for the purpose of conducting a multidimensional analysis (MDA) of Czech.
At the topmost level, it is divided into three modes of communication: written language, spoken language, and web-based communication. In addition to the purpose of MDA, it could be used in conducting other language analyses.