The sound database formation for the allophone-based model for english concatenative speech synthesis

Karina Evgrafova

The goal of this paper is to describe the development of the sound database for the allophone-based model for English concatenative speech synthesis. The procedure of the sound unit inventory construction is described and its main results are presented. At present moment the optimized sound units inventory of the allophonic database for English concatenative speech synthesis contains 1200 elements (1000 vowel allophones and 200 consonant allophones). The smoothness of junctions between the allophones shows high quality of the segmentation made. The decrease in the number of the database components in the result of optimization does not affect the quality of the resulting synthesized speech. At the level of segments it can be evaluated as fairly high in terms of both naturalness and intelligibility.

Язык оригинала	английский
Название основной публикации	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Страницы	219-225
Число страниц	7
Состояние	Опубликовано - 1 дек 2005
Событие	8th International Conference on Text, Speech and Dialogue, TSD 2005 - Karlovy Vary, Чехия Продолжительность: 12 сен 2005 → 15 сен 2005

Серия публикаций

Название	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Том	3658 LNAI
ISSN (печатное издание)	0302-9743
ISSN (электронное издание)	1611-3349

конференция

конференция	8th International Conference on Text, Speech and Dialogue, TSD 2005
Страна/Tерритория	Чехия
Город	Karlovy Vary
Период	12/09/05 → 15/09/05

Предметные области Scopus

Биохимия, генетика и молекулярная биология (все)
Компьютерные науки (все)
Теоретические компьютерные науки

ID: 41279942