Charles Explorer logo
🇬🇧

Identification of Idioms in Spoken Corpora

Publication at Faculty of Arts |
2013

Abstract

This paper focuses on the automatic identification of idioms within the transcript of spoken discourse which are included in the spoken corpora ČNK (PMK, ORAL2006 and ORAL2008). In PMK, the idioms were manually searched for and identified, so it is possible to compare the efficiency of automatic and manual identification and describe the advantages and disadvantages of both approaches.