CsEnVi Pairwise Parallel Corpora is a collection of two parallel corpora: an English-Vietnamese one and a Czech-Vietnamese one. The corpora contain translations of movie and TED subtitles that were already available.
The corpora were cleaned using our semi-automatic filtering and are provided aligned at the sentence level.