Charles Explorer logo
🇬🇧

Prague Database of Spoken English

Publication

Abstract

This CD brings you a multi-purpose corpus of spoken dialog English. 145 469 tokens, 12 203 sentences and 864 minutes of spontaneous dialog speech have been recorded, manually transcribed and manually edited in three interlinked layers. The domain is reminiscing about personal photograph collections, which were recorded within the Companions project.

The goal of this project was to create virtual companions that would be able to have a natural conversation with humans. The setup is partly Wizard of Oz, but mostly conversation of two humans.