Charles Explorer logo
🇬🇧

CoNLL 2017 Shared Task - Automatically Annotated Raw Texts and Word Embeddings

Publication

Abstract

Automatic segmentation, tokenization and morphological and syntactic annotations of raw texts in 45 languages, generated by UDPipe (http://ufal.mff.cuni.cz/udpipe), together with word embeddings of dimension 100 computed from lowercased texts by word2vec (https://code.google.com/archive/p/word2vec/).