The «One Day of Speech» Corpus: Phonetic and Syntactic Studies of Everyday Spoken Russian

Research output

12 Citations (Scopus)

Abstract

The studies described in the paper are made on the base of the ORD – “One day of speech” – corpus of Russian everyday speech which contains long-term audio recordings of daily communication. The ORD corpus provides rich authentic material for research in phonetics and syntax of spoken Russian, and may be used for adjustment and improvement of speech synthesis and recognition systems. Current phonetic investigations of the ORD corpus relate to temporal studies, study of speech reduction, phonetic realization of words and affixes, investigation of phonetic errors and mondegreens, studies of rhythm structures and hesitation phenomena. Syntactic studies primarily deal with linear word order of syntactic groups, syntactic complexity of spoken utterances, and specific syntactic phenomena of spontaneous speech. In this paper, we summarize main achievements in phonetic and syntactic studies made on the base of the ORD corpus and outline some directions for further investigations.
Original languageEnglish
Pages (from-to)429-437
JournalLecture Notes in Computer Science
Volume9319
DOIs
Publication statusPublished - 2015

Fingerprint

Speech analysis
Syntactics
Audio recordings
Speech synthesis
Speech Synthesis
Speech recognition
Speech Recognition
Speech
Corpus
Syntax
Adjustment
Communication

Scopus subject areas

  • Language and Linguistics
  • Information Systems

Cite this

@article{a28661a13c8344ba8c5dc082c4010a9a,
title = "The «One Day of Speech» Corpus: Phonetic and Syntactic Studies of Everyday Spoken Russian",
abstract = "The studies described in the paper are made on the base of the ORD – “One day of speech” – corpus of Russian everyday speech which contains long-term audio recordings of daily communication. The ORD corpus provides rich authentic material for research in phonetics and syntax of spoken Russian, and may be used for adjustment and improvement of speech synthesis and recognition systems. Current phonetic investigations of the ORD corpus relate to temporal studies, study of speech reduction, phonetic realization of words and affixes, investigation of phonetic errors and mondegreens, studies of rhythm structures and hesitation phenomena. Syntactic studies primarily deal with linear word order of syntactic groups, syntactic complexity of spoken utterances, and specific syntactic phenomena of spontaneous speech. In this paper, we summarize main achievements in phonetic and syntactic studies made on the base of the ORD corpus and outline some directions for further investigations.",
keywords = "Everyday spoken Russian · Speech corpus · Phonetics ·Synax, phonetics, syntax, speech corpus",
author = "N. Bogdanova-Beglarian and T. Sherstinova and G. Martynenko",
year = "2015",
doi = "10.1007/978-3-319-23132-7_53",
language = "English",
volume = "9319",
pages = "429--437",
journal = "Lecture Notes in Computer Science",
issn = "0302-9743",
publisher = "Springer",

}

TY - JOUR

T1 - The «One Day of Speech» Corpus: Phonetic and Syntactic Studies of Everyday Spoken Russian

AU - Bogdanova-Beglarian, N.

AU - Sherstinova, T.

AU - Martynenko, G.

PY - 2015

Y1 - 2015

N2 - The studies described in the paper are made on the base of the ORD – “One day of speech” – corpus of Russian everyday speech which contains long-term audio recordings of daily communication. The ORD corpus provides rich authentic material for research in phonetics and syntax of spoken Russian, and may be used for adjustment and improvement of speech synthesis and recognition systems. Current phonetic investigations of the ORD corpus relate to temporal studies, study of speech reduction, phonetic realization of words and affixes, investigation of phonetic errors and mondegreens, studies of rhythm structures and hesitation phenomena. Syntactic studies primarily deal with linear word order of syntactic groups, syntactic complexity of spoken utterances, and specific syntactic phenomena of spontaneous speech. In this paper, we summarize main achievements in phonetic and syntactic studies made on the base of the ORD corpus and outline some directions for further investigations.

AB - The studies described in the paper are made on the base of the ORD – “One day of speech” – corpus of Russian everyday speech which contains long-term audio recordings of daily communication. The ORD corpus provides rich authentic material for research in phonetics and syntax of spoken Russian, and may be used for adjustment and improvement of speech synthesis and recognition systems. Current phonetic investigations of the ORD corpus relate to temporal studies, study of speech reduction, phonetic realization of words and affixes, investigation of phonetic errors and mondegreens, studies of rhythm structures and hesitation phenomena. Syntactic studies primarily deal with linear word order of syntactic groups, syntactic complexity of spoken utterances, and specific syntactic phenomena of spontaneous speech. In this paper, we summarize main achievements in phonetic and syntactic studies made on the base of the ORD corpus and outline some directions for further investigations.

KW - Everyday spoken Russian · Speech corpus · Phonetics ·Synax

KW - phonetics

KW - syntax

KW - speech corpus

U2 - 10.1007/978-3-319-23132-7_53

DO - 10.1007/978-3-319-23132-7_53

M3 - Article

VL - 9319

SP - 429

EP - 437

JO - Lecture Notes in Computer Science

JF - Lecture Notes in Computer Science

SN - 0302-9743

ER -