Charles Explorer logo
🇬🇧

Representing Layered and Structured Data in the CoNLL-ST Format

Publication at Faculty of Mathematics and Physics |
2010

Abstract

In this paper, we investigate the CoNLL Shared Task format, its properties and possibility of its use for complex annotations. We argue that, perhaps despite the original intent, it is one of the most important current formats for syntactically annotated data.

We show the limits of the CoNLL-ST data format in its current form and propose several simple enhancements that push those limits further and make the format more robust and future proof. We analyse several different linguistic annotations as examples of varying complexity and show how they can be efficiently stored in the CoNLL-ST format.