Charles Explorer logo
🇨🇿

Building a Data Repository of Spontaneous Spoken Czech

Publikace na Filozofická fakulta |
2014

Tento text není v aktuálním jazyce dostupný. Zobrazuje se verze "en".Abstrakt

The paper presents data repository of spontaneous spoken Czech, its design principles and practical solutions adopted during the data collection. The repository is designed as a representation of contemporary spontaneous spoken language used in informal, real-life situations on the area of the whole Czech Republic.

Therefore, it features manual annotation and broad regional coverage with large variety of speakers. The repository data contain both the audio recordings and their transcriptions manually aligned with time stamps.