CORPRES › Научные исследования в СПбГУ

DOI

https://doi.org/10.1007/978-3-642-15760-8_50
Конечная издательская версия

The paper introduces CORPRES - COrpus of Russian Professionally REad Speech developed at the Department of Phonetics, Saint Petersburg State University, as a result of a three-year project. The corpus includes samples of different speaking styles produced by 4 male and 4 female speakers. Six levels of annotation cover all phonetic and prosodic information about the recorded speech data, including labels for pitch marks, phonetic events, phonetic, orthographic and prosodic transcription. Precise phonetic transcription of the data provides an especially valuable resource for both research and development purposes. Overall corpus size is 60 hours of speech. The paper contains information about CORPRES design and annotation principles, and overall data description. Also, we discuss possible use of the corpus in phonetic research and speech technology as well as some findings on the Russian sound system obtained from the corpus data.

Язык оригинала	английский
Название основной публикации	Text, Speech and Dialogue - 13th International Conference, TSD 2010, Proceedings
Место публикации	Berlin Heidelberg
Издатель	Springer Nature
Страницы	392-399
Число страниц	8
ISBN (печатное издание)	3642157599, 9783642157592
DOI	https://doi.org/10.1007/978-3-642-15760-8_50
Состояние	Опубликовано - 2010
Событие	13th International Conference on Text, Speech and Dialogue, TSD 2010: 13th International Conference - Brno, Чехия Продолжительность: 6 сен 2010 → 10 сен 2010 Номер конференции: 13 https://www.tsdconference.org/

Серия публикаций

Название	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Том	6231 LNAI
ISSN (печатное издание)	0302-9743
ISSN (электронное издание)	1611-3349

конференция

конференция	13th International Conference on Text, Speech and Dialogue, TSD 2010
Сокращенное название	TSD 2010
Страна/Tерритория	Чехия
Город	Brno
Период	6/09/10 → 10/09/10
Сайт в сети Internet	https://www.tsdconference.org/

Предметные области Scopus

Теоретические компьютерные науки
Компьютерные науки (все)

ID: 4428711

CORPRES: Corpus of Russian professionally read speech

DOI

Серия публикаций

конференция

Предметные области Scopus