Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
CORPRES : Corpus of Russian professionally read speech. / Skrelin, Pavel; Volskaya, Nina; Kocharov, Daniil; Evgrafova, Karina; Glotova, Olga; Evdokimova, Vera.
Text, Speech and Dialogue - 13th International Conference, TSD 2010, Proceedings. Berlin Heidelberg : Springer Nature, 2010. стр. 392-399 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Том 6231 LNAI).Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
}
TY - GEN
T1 - CORPRES
T2 - 13th International Conference on Text, Speech and Dialogue, TSD 2010
AU - Skrelin, Pavel
AU - Volskaya, Nina
AU - Kocharov, Daniil
AU - Evgrafova, Karina
AU - Glotova, Olga
AU - Evdokimova, Vera
N1 - Conference code: 13
PY - 2010
Y1 - 2010
N2 - The paper introduces CORPRES - COrpus of Russian Professionally REad Speech developed at the Department of Phonetics, Saint Petersburg State University, as a result of a three-year project. The corpus includes samples of different speaking styles produced by 4 male and 4 female speakers. Six levels of annotation cover all phonetic and prosodic information about the recorded speech data, including labels for pitch marks, phonetic events, phonetic, orthographic and prosodic transcription. Precise phonetic transcription of the data provides an especially valuable resource for both research and development purposes. Overall corpus size is 60 hours of speech. The paper contains information about CORPRES design and annotation principles, and overall data description. Also, we discuss possible use of the corpus in phonetic research and speech technology as well as some findings on the Russian sound system obtained from the corpus data.
AB - The paper introduces CORPRES - COrpus of Russian Professionally REad Speech developed at the Department of Phonetics, Saint Petersburg State University, as a result of a three-year project. The corpus includes samples of different speaking styles produced by 4 male and 4 female speakers. Six levels of annotation cover all phonetic and prosodic information about the recorded speech data, including labels for pitch marks, phonetic events, phonetic, orthographic and prosodic transcription. Precise phonetic transcription of the data provides an especially valuable resource for both research and development purposes. Overall corpus size is 60 hours of speech. The paper contains information about CORPRES design and annotation principles, and overall data description. Also, we discuss possible use of the corpus in phonetic research and speech technology as well as some findings on the Russian sound system obtained from the corpus data.
KW - annotation
KW - manual transcription
KW - phonetic transcription
KW - Phonetics
KW - prosodic feature labelling
KW - speech corpus
KW - text-to-speech
UR - http://www.scopus.com/inward/record.url?scp=78049250875&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-15760-8_50
DO - 10.1007/978-3-642-15760-8_50
M3 - Conference contribution
SN - 3642157599
SN - 9783642157592
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 392
EP - 399
BT - Text, Speech and Dialogue - 13th International Conference, TSD 2010, Proceedings
PB - Springer Nature
CY - Berlin Heidelberg
Y2 - 6 September 2010 through 10 September 2010
ER -
ID: 4428711