Charles Explorer logo
🇬🇧

Prague English Dependency Treebank 1.0

Publication

Abstract

This CD presents part of the Prague English Dependency Treebank (PEDT). PEDT is the manual tectogrammatical (syntactico-semantic) annotation of texts from the Wall Street Journal - Penn Treebank III.

The present CD (PEDT 1.0) comprises approx. 10,000 annotated and checked trees, which is about 20% of the original WSJ-PTB. The following components are included: * manually annotated data, integrated valency lexicon Engvallex * the valency lexicon Engvallex in printable form (latest revision: January 2009) * the ready-to-install package of the tree editor/viewer TREd * documentation * specification of the annotation format (Prague Markup Language)