Stop-probability estimates computed on a large corpus improve Unsupervised Dependency Parsing

Publication at Faculty of Mathematics and Physics |

2013

Abstract

Even though the quality of unsupervised dependency parsers grows, they often fail in recognition of very basic dependencies. In this paper, we exploit a prior knowledge of STOP-probabilities (whether a given word has any children in a given direction), which is obtained from a large raw corpus using the reducibility principle.

By incorporating this knowledge into Dependency Model with Valence, we managed to considerably outperform the state-of-the-art results in terms of average attachment score over 20 treebanks from CoNLL 2006 and 2007 shared tasks.

Keywords

stop probability estimates computed large corpus improve unsupervised dependency parsing