Preparing audio recordings of everyday speech for prosody research: The case of the ord corpus

Результат исследований: Научные публикации в периодических изданияхстатья


Studying prosody is important for understanding many linguistic, pragmatic, and discourse phenomena, as well as for solution of many applied tasks (in particular, in speech technologies). Prosody of everyday speech is extremely diverse, demonstrating high interpersonal and intrapersonal variations. Furthermore, natural everyday speech produces a multitude of effects which are hardly possible to obtain in speech laboratories. Because of this fact, it is very important to create resources containing representative collections of everyday speech data. The ORD corpus is a large resource aimed at studying everyday Russian speech. The paper describes the main stages of speech processing in the ORD corpus starting from segmentation of original files into macroepisodes and up to compiling prosody information into the database. This prosody database will be further used for building empirical prosody models.

Язык оригиналаанглийский
Страницы (с-по)623-631
Число страниц9
ЖурналLecture Notes in Computer Science
СостояниеОпубликовано - 2017
Событие19th International Conference on Speech and Computer, SPECOM 2017 - Hatfield, Великобритания
Продолжительность: 11 сен 201715 сен 2017


Предметные области Scopus

  • Языки и лингвистика
  • Компьютерные науки (все)
  • Теоретические компьютерные науки