Abstract
This paper deals with acoustic properties of backchannels – those turns within a dialogue which do not convey information but signify that the speaker is listening to his/her interlocutor (uh-huh, hm etc.). The research is based on a Russian corpus of dialogue speech, SibLing, a part of which (339 min of speech) was manually segmented into backchannels and non-backchannels. Then, a number of acoustic parameters was calculated: duration, intensity, fundamental frequency, and pause duration. Our data have shown that in Russian speech backchannels are shorter and have lower loudness and pitch than non-backchannels. After that, two classifiers were tested: CART and SVM. The highest efficiency was achieved using SVM (F 1 = 0.651) and the following feature set: duration, maximum fundamental frequency, melodic slope. The most valuable feature was duration.
Original language | English |
---|---|
Title of host publication | Speech and Computer |
Subtitle of host publication | 22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7–9, 2020, Proceedings |
Editors | Alexey Karpov, Rodmonga Potapova |
Place of Publication | Cham |
Publisher | Springer Nature |
Pages | 204-213 |
ISBN (Electronic) | 978-3-030-60276-5 |
ISBN (Print) | 978-3-030-60275-8 |
DOIs | |
State | Published - 2020 |
Event | 22nd International Conference on Speech and Computer - St. Petersburg, Russia => Online, St. Petersburg, Russian Federation Duration: 7 Oct 2020 → 9 Oct 2020 http://specom.nw.ru/2020/program/SPECOM-ICR2020-Conference-Program-06102020.pdf |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Volume | 12335 |
ISSN (Print) | 0302-9743 |
Conference
Conference | 22nd International Conference on Speech and Computer |
---|---|
Abbreviated title | SPECOM 2020 |
Country/Territory | Russian Federation |
City | St. Petersburg |
Period | 7/10/20 → 9/10/20 |
Internet address |