Trees as a data structure (dependency trees, phrase-based trees, word order, projectivity)
Dependency and non-dependency relations in natural languages
Family of Prague Dependency Treebanks - introduction and principles; Functional Generative
Description as their theoretical basis
Universal Dependencies - introduction and principles
Stratificational approach to natural language description: morphology and its annotation in dependency treebanks
(surface) syntax and its annotation in dependency treebanks
(deep) syntax and its annotation in dependency treebanks
Annotation of selected deep syntactic phenomena
Annotation schemata, data formats
Tools (TrEd, PML-TQ, Udapi)
The goal of the course is to introduce a dependency-based description of natural languages, principles of dependency-based grammar formalisms and their application in morphologically and syntactically annotated corpora. The course will focus on the Prague Dependency Treebank project and on the Universal Dependencies project.
The emphasis is also placed on annotation schemata and data formats, on practical work with treebanks and useful tools. The course is designed for students with the computer science background as well as for linguists with some CS experience.