Charles Explorer logo
🇬🇧

UFAL Parallel Corpus of North Levantine 1.0

Publication

Abstract

This is the first release of the UFAL Parallel Corpus of North Levantine, compiled by the Institute of Formal and Applied Linguistics (ÚFAL) at Charles University within the Welcome project (https://welcome-h2020.eu/). The corpus consists of 120,600 multiparallel sentences in English, French, German, Greek, Spanish, and Standard Arabic selected from the OpenSubtitles2018 corpus and manually translated into the North Levantine Arabic language.

The corpus was created for the purpose of training machine translation for North Levantine and the other languages.