Charles Explorer logo
🇬🇧

Treq - Translation Equivalents Database

Publication

Abstract

Treq is a collection of bilingual Czech-foreign language dictionaries, built automatically from the InterCorp parallel corpus. The first step was to align the original and translated texts word-to-word using statistical methods provided by the GIZA++ program.

The aligned word pairs were then sorted and summarized. The results of the automatic excerption were not reviewed; however, the relative frequency of a given pair may serve as an approximate indicator of reliability.

The more often an equivalent of a given word occurs in comparison with other equivalents, the more useful it may be.