The present contribution is a theoretical and methodological study of the possibilities of discourse processing by corpus methods. Despite the description complexity of phenomena "beyond the sentence boundary", we argue that, keeping in mind all the specific issues this task brings along, even more ways of a systematic analysis are possible.
Taking into account various attempts of the last decade in creating discourse-annotated corpora, a reliable way to proceed in any such analysis shows to be to distinguish among different layers of discourse analysis (in particular between "semantic" and "pragmatic" aspects) and to stick with the language form in opposition to classifying phenomena with no surface realization.