Charles Explorer logo
🇬🇧

Tackling Sparse Data Issue in Machine Translation Evaluation

Publication at Faculty of Mathematics and Physics |
2010

Abstract

We illustrate and explain problems of n-grams-based machine translation (MT) metrics (e.g. BLEU) when applied to morphologically rich languages such as Czech.

A novel metric SemPOS based on the deep-syntactic representation of the sentence tackles the issue and retains the performance for translation to English as well.