It is focused on specifics of data of spontaneous spoken Czech which will be used as a source of data for ORAL2013. It presents its design principles and practical problem and solutions adopted during the data collection.