The chief objective of the study was to observe phrasing behaviour of transformer-based neural networks from the linguistic point of view. The transformer-based architecture mapped prosodic phrasing in isolated sentences read out on request, but was commanded to predict prosodic phrases in continuous texts of journalistic style taken from radio news bulletins.
The transfer was quite successful in that most of the prosodic phrase boundaries in the actual newsreading (established by expert auditory analysis) were correctly suggested by the machine. This result is not unexpected as both genres belong to clearly enunciated informative speaking style.
The outcome partially rehabilitates the so-called laboratory speech, which is sometimes branded as ecologically invalid. The follow-up analyses revealed that the differences between human phrasing in news bulletins and the partition suggested by the machine can be classified into meaningful linguistic categories based on the syntactic structure or semantic contents, and as such, they can inform further research design.