DOI

The article examines the perception and extraction of keyphrases in both written and spoken text. Experiments were performed on the dataset including transcripts and audio recordings of lectures by Russian-speaking participants of the project “Postnauka”. The results show that automated methods for keyphrase extraction have limited accuracy, with statistical algorithms performing the worst and generative AI models, such as ChatGPT, showing a closer resemblance to human perception. Additionally, while there is some overlap between keyphrases extracted from written and oral texts, spoken text presents greater variability. Experiments using synthesized speech indicate that listeners rely heavily on content, rather than acoustic cues, when understanding spoken text. Acoustic analysis reveals that keyphrases are distinguished by longer duration, wider pitch range, and higher energy, aligning with previous findings in other languages.
Язык оригиналаанглийский
Название основной публикацииSpeech and Computer
Подзаголовок основной публикации26th International Conference, SPECOM 2024, Belgrade, Serbia, November 25–28, 2024, Proceedings, Part I
Страницы265-280
Число страниц16
DOI
СостояниеОпубликовано - 2025
СобытиеXXVIth International Conference “Speech and Computer”: Specom 2024 - University of Novi Sad, Белград, Сербия
Продолжительность: 25 ноя 202428 ноя 2024
Номер конференции: 26
https://specom.nw.ru/2024/
https://specom2024.ftn.uns.ac.rs
https://specom2024.ftn.uns.ac.rs/

Серия публикаций

НазваниеLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Том15299 LNAI

конференция

конференцияXXVIth International Conference “Speech and Computer”
Сокращенное названиеSPECOM-2024
Страна/TерриторияСербия
ГородБелград
Период25/11/2428/11/24
Сайт в сети Internet

ID: 126874264