Preparing audio recordings of everyday speech for prosody research: The case of the ord corpus

Research output

Abstract

Studying prosody is important for understanding many linguistic, pragmatic, and discourse phenomena, as well as for solution of many applied tasks (in particular, in speech technologies). Prosody of everyday speech is extremely diverse, demonstrating high interpersonal and intrapersonal variations. Furthermore, natural everyday speech produces a multitude of effects which are hardly possible to obtain in speech laboratories. Because of this fact, it is very important to create resources containing representative collections of everyday speech data. The ORD corpus is a large resource aimed at studying everyday Russian speech. The paper describes the main stages of speech processing in the ORD corpus starting from segmentation of original files into macroepisodes and up to compiling prosody information into the database. This prosody database will be further used for building empirical prosody models.

Original languageEnglish
Pages (from-to)623-631
Number of pages9
JournalLecture Notes in Computer Science
Volume10458
DOIs
Publication statusPublished - 2017
Event19th International Conference on Speech and Computer, SPECOM 2017 - Hatfield
Duration: 11 Sep 201715 Sep 2017

Fingerprint

Audio recordings
Prosody
Speech Processing
Speech processing
Resources
Linguistics
Speech
Corpus
Audio Recordings
Segmentation

Scopus subject areas

  • Language and Linguistics
  • Computer Science(all)
  • Theoretical Computer Science

Cite this

@article{aa11adf2487747eeae04e66aee90e340,
title = "Preparing audio recordings of everyday speech for prosody research: The case of the ord corpus",
abstract = "Studying prosody is important for understanding many linguistic, pragmatic, and discourse phenomena, as well as for solution of many applied tasks (in particular, in speech technologies). Prosody of everyday speech is extremely diverse, demonstrating high interpersonal and intrapersonal variations. Furthermore, natural everyday speech produces a multitude of effects which are hardly possible to obtain in speech laboratories. Because of this fact, it is very important to create resources containing representative collections of everyday speech data. The ORD corpus is a large resource aimed at studying everyday Russian speech. The paper describes the main stages of speech processing in the ORD corpus starting from segmentation of original files into macroepisodes and up to compiling prosody information into the database. This prosody database will be further used for building empirical prosody models.",
keywords = "Communication settings, Duration, Everyday speech, Phonetics, Pitch, Pragmatics, Prosody, Russian, Sociolinguistics, Speech corpus",
author = "Tatiana Sherstinova",
year = "2017",
doi = "10.1007/978-3-319-66429-3_62",
language = "English",
volume = "10458",
pages = "623--631",
journal = "Lecture Notes in Computer Science",
issn = "0302-9743",
publisher = "Springer",

}

TY - JOUR

T1 - Preparing audio recordings of everyday speech for prosody research

T2 - The case of the ord corpus

AU - Sherstinova, Tatiana

PY - 2017

Y1 - 2017

N2 - Studying prosody is important for understanding many linguistic, pragmatic, and discourse phenomena, as well as for solution of many applied tasks (in particular, in speech technologies). Prosody of everyday speech is extremely diverse, demonstrating high interpersonal and intrapersonal variations. Furthermore, natural everyday speech produces a multitude of effects which are hardly possible to obtain in speech laboratories. Because of this fact, it is very important to create resources containing representative collections of everyday speech data. The ORD corpus is a large resource aimed at studying everyday Russian speech. The paper describes the main stages of speech processing in the ORD corpus starting from segmentation of original files into macroepisodes and up to compiling prosody information into the database. This prosody database will be further used for building empirical prosody models.

AB - Studying prosody is important for understanding many linguistic, pragmatic, and discourse phenomena, as well as for solution of many applied tasks (in particular, in speech technologies). Prosody of everyday speech is extremely diverse, demonstrating high interpersonal and intrapersonal variations. Furthermore, natural everyday speech produces a multitude of effects which are hardly possible to obtain in speech laboratories. Because of this fact, it is very important to create resources containing representative collections of everyday speech data. The ORD corpus is a large resource aimed at studying everyday Russian speech. The paper describes the main stages of speech processing in the ORD corpus starting from segmentation of original files into macroepisodes and up to compiling prosody information into the database. This prosody database will be further used for building empirical prosody models.

KW - Communication settings

KW - Duration

KW - Everyday speech

KW - Phonetics

KW - Pitch

KW - Pragmatics

KW - Prosody

KW - Russian

KW - Sociolinguistics

KW - Speech corpus

UR - http://www.scopus.com/inward/record.url?scp=85029511641&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-66429-3_62

DO - 10.1007/978-3-319-66429-3_62

M3 - Article

AN - SCOPUS:85029511641

VL - 10458

SP - 623

EP - 631

JO - Lecture Notes in Computer Science

JF - Lecture Notes in Computer Science

SN - 0302-9743

ER -