Charles Explorer logo
🇬🇧

Extending Coverage of a Lexicon of Discourse Connectives Using Annotation Projection

Publication at Faculty of Mathematics and Physics |
2021

Abstract

We present a method for extending coverage of the Lexicon of Czech Discourse Connectives - CzeDLex - using annotation projection. We take advantage of two language resources: (i) the Penn Discourse Treebank 3.0 as a source of manually annotated discourse relations in English, and (ii) the Prague Czech-English Dependency Treebank 2.0 as a translation of the English texts to Czech and a link between tokens on the two language sides.

Although CzeDLex was originally extracted from a large Czech corpus, the presented method resulted in an addition of a number of new connectives and new types of usages (discourse types) for already present entries in the lexicon.