Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
In search of sentence boundaries in spontaneous speech. / Bogdanova-Beglarian, Natalia.
Speech and Computer - 19th International Conference, SPECOM 2017, Proceedings. ред. / Alexey Karpov; Iosif Mporas; Rodmonga Potapova. Springer Nature, 2017. стр. 456-463 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Том 10458 LNAI).Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
}
TY - GEN
T1 - In search of sentence boundaries in spontaneous speech
AU - Bogdanova-Beglarian, Natalia
N1 - Bogdanova-Beglarian N. In Search of Phrase Boundaries in Spontaneous Speech // SPECOM 2017. Lecture Notes in Artificial Intelligence, LNAI, vol. 10458. – Springer, Switzerland, 2017. – Pp. 456-463.
PY - 2017/1/1
Y1 - 2017/1/1
N2 - Oral text is certainly discrete. It is built of “small bricks”, units of not only lexical but also the higher syntactical level. Common syntagmatic pauses, hesitative pauses such as physical (unfilled ones including breaks of clauses), sound pauses (e-e, m-m), and verbal (vot, kak eto, nu, znachit etc.) are markers of this discreetness. However, that reveals neither syntagma nor sentence as a unit to describe a syntactic structure of an oral text. Any type of pauses may occur in any place of an audio sequence. Thus, the search of sentences in spontaneous speech is quite complicated. In order to obtain such units a methodic of coercive punctuation that was used for marking the spontaneous monologues from the collection of oral texts named «Balanced Annotated Textotec» could be offered. The testee (philology experts) were asked to mark ends of the sentences by putting a period in the transcripts where neither pauses nor punctuation had been marked. The testee could only rely on the syntactic structure of the text and the connection between words and predicate centers. Involving more than twenty experts in an experiment provides more statistically accurate results. In this work we describe the results of our experiment and discuss further perspectives how those results can be used for automatic search of sentence boundaries in spontaneous speech.
AB - Oral text is certainly discrete. It is built of “small bricks”, units of not only lexical but also the higher syntactical level. Common syntagmatic pauses, hesitative pauses such as physical (unfilled ones including breaks of clauses), sound pauses (e-e, m-m), and verbal (vot, kak eto, nu, znachit etc.) are markers of this discreetness. However, that reveals neither syntagma nor sentence as a unit to describe a syntactic structure of an oral text. Any type of pauses may occur in any place of an audio sequence. Thus, the search of sentences in spontaneous speech is quite complicated. In order to obtain such units a methodic of coercive punctuation that was used for marking the spontaneous monologues from the collection of oral texts named «Balanced Annotated Textotec» could be offered. The testee (philology experts) were asked to mark ends of the sentences by putting a period in the transcripts where neither pauses nor punctuation had been marked. The testee could only rely on the syntactic structure of the text and the connection between words and predicate centers. Involving more than twenty experts in an experiment provides more statistically accurate results. In this work we describe the results of our experiment and discuss further perspectives how those results can be used for automatic search of sentence boundaries in spontaneous speech.
KW - Discreetness of the oral text
KW - Phrase boundary
KW - Sentence
KW - Speech corpus
KW - Spontaneous monologue
KW - Syntagma
KW - speech disfluencies, phrase breaks, spoken Russian, sociophonetics
UR - http://www.scopus.com/inward/record.url?scp=85029542337&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-66429-3_45
DO - 10.1007/978-3-319-66429-3_45
M3 - Conference contribution
AN - SCOPUS:85029542337
SN - 9783319664286
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 456
EP - 463
BT - Speech and Computer - 19th International Conference, SPECOM 2017, Proceedings
A2 - Karpov, Alexey
A2 - Mporas, Iosif
A2 - Potapova, Rodmonga
PB - Springer Nature
T2 - 19th International Conference on Speech and Computer
Y2 - 11 September 2017 through 15 September 2017
ER -
ID: 50412351