Extended nominal coreference and bridging anaphora (An approach to annotation of Czech data in the Prague Dependency Treebank)

Publication at Faculty of Mathematics and Physics |

2011

Abstract

This book aims to present one of the possible models of processing extended textual coreference and bridging anaphora in a large textual corpora, which we then use for annotation of certain relations in texts of the Prague Dependency Treebank (PDT). We compare our annotation scheme to the existing ones with respect to the language to which the scheme is applied.

We identify the annotation principles and demonstrate their application to the large-scale annotation of Czech texts. We further present our classification of coreferential relations and bridging relations types and discuss some problematic aspects in this area.

An automatic pre-annotation and some helpful features of the annotation tool, such as maintaining coreferential chain, underlining candidates for antecedents, etc. are presented and discussed. Statistical evaluation is performed on the already annotated part of the Prague Dependency Treebank.

We also present the first results of the inter-annotator agreement measurement and explain the

Keywords

extended nominal coreference bridging anaphora approach annotation czech data prague dependency treebank