Charles Explorer logo
🇬🇧

Dependency Grammars and Treebanks

Class at Faculty of Mathematics and Physics |
NPFX075

Syllabus

Trees as a data structure (dependency trees, phrase-based trees, word order, projectivity)

Dependency and non-dependency relations in natural languages

Family of Prague Dependency Treebanks - introduction and principles; Functional Generative

Description as their theoretical basis

Universal Dependencies - introduction and principles

Stratificational approach to natural language description: morphology and its annotation in dependency treebanks

(surface) syntax and its annotation in dependency treebanks

(deep) syntax and its annotation in dependency treebanks

Annotation of selected deep syntactic phenomena

Annotation schemata, data formats

Tools (TrEd, PML-TQ, Udapi)

Annotation

The goal of the course is to introduce a dependency-based description of natural languages, principles of dependency-based grammar formalisms and their application in morphologically and syntactically annotated corpora. The course will focus on the Prague

Dependency Treebank project and on the Universal Dependencies project. The emphasis is also placed on annotation schemata and the data format, on familiarization with useful tools and practical work with treebanks. The course is designed for students with the computer science background as well as for linguists with some CS experience.