Standard

Eliciting Meaningful Units from Speech. / Кочаров, Даниил Александрович; Качковская, Татьяна Васильевна; Скрелин, Павел Анатольевич.

Proceeding of Interspeech 2017. 2017. стр. 2128-2132.

Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференцийстатья в сборнике материалов конференциинаучнаяРецензирование

Harvard

Кочаров, ДА, Качковская, ТВ & Скрелин, ПА 2017, Eliciting Meaningful Units from Speech. в Proceeding of Interspeech 2017. стр. 2128-2132, Interspeech 2017, Stockholm, Швеция, 20/08/17. https://doi.org/10.21437/Interspeech.2017-855

APA

Vancouver

Author

BibTeX

@inproceedings{3c9d016666704ccf811efb2898a03cfd,
title = "Eliciting Meaningful Units from Speech",
abstract = "Elicitation of information structure from speech is a crucial stepin automatic speech understanding. In terms of both productionand perception, we consider intonational phrase to be the basicmeaningful unit of information structure in speech. The cur-rent paper presents a method of detecting these units in speechby processing both the recorded speech and its textual repre-sentation. Using syntactic information, we split text into smallgroups of words closely connected with each other. Assum-ing that intonational phrases are built from these small groups,we use acoustic information to reveal their actual boundaries.The procedure was initially developed for processing Russianspeech, and we have achieved the best published results forthis language with F1equal to 0.91. We assume that it maybe adapted for other languages that have some amount of readspeech resources, including under-resourced languages. Forcomparison we have evaluated it on English material (BostonUniversity Radio Speech Corpus). Our results, F1of 0.76, arecomparable with the top systems designed for English.",
author = "Кочаров, {Даниил Александрович} and Качковская, {Татьяна Васильевна} and Скрелин, {Павел Анатольевич}",
year = "2017",
doi = "10.21437/Interspeech.2017-855",
language = "English",
pages = "2128--2132",
booktitle = "Proceeding of Interspeech 2017",
note = "Interspeech 2017 ; Conference date: 20-08-2017 Through 24-08-2017",

}

RIS

TY - GEN

T1 - Eliciting Meaningful Units from Speech

AU - Кочаров, Даниил Александрович

AU - Качковская, Татьяна Васильевна

AU - Скрелин, Павел Анатольевич

PY - 2017

Y1 - 2017

N2 - Elicitation of information structure from speech is a crucial stepin automatic speech understanding. In terms of both productionand perception, we consider intonational phrase to be the basicmeaningful unit of information structure in speech. The cur-rent paper presents a method of detecting these units in speechby processing both the recorded speech and its textual repre-sentation. Using syntactic information, we split text into smallgroups of words closely connected with each other. Assum-ing that intonational phrases are built from these small groups,we use acoustic information to reveal their actual boundaries.The procedure was initially developed for processing Russianspeech, and we have achieved the best published results forthis language with F1equal to 0.91. We assume that it maybe adapted for other languages that have some amount of readspeech resources, including under-resourced languages. Forcomparison we have evaluated it on English material (BostonUniversity Radio Speech Corpus). Our results, F1of 0.76, arecomparable with the top systems designed for English.

AB - Elicitation of information structure from speech is a crucial stepin automatic speech understanding. In terms of both productionand perception, we consider intonational phrase to be the basicmeaningful unit of information structure in speech. The cur-rent paper presents a method of detecting these units in speechby processing both the recorded speech and its textual repre-sentation. Using syntactic information, we split text into smallgroups of words closely connected with each other. Assum-ing that intonational phrases are built from these small groups,we use acoustic information to reveal their actual boundaries.The procedure was initially developed for processing Russianspeech, and we have achieved the best published results forthis language with F1equal to 0.91. We assume that it maybe adapted for other languages that have some amount of readspeech resources, including under-resourced languages. Forcomparison we have evaluated it on English material (BostonUniversity Radio Speech Corpus). Our results, F1of 0.76, arecomparable with the top systems designed for English.

U2 - 10.21437/Interspeech.2017-855

DO - 10.21437/Interspeech.2017-855

M3 - Conference contribution

SP - 2128

EP - 2132

BT - Proceeding of Interspeech 2017

T2 - Interspeech 2017

Y2 - 20 August 2017 through 24 August 2017

ER -

ID: 122811947