Результаты исследований: Научные публикации в периодических изданиях › статья в журнале по материалам конференции › Рецензирование
Pragmatic markers and parts of speech : On the problems of annotation of the speech corpus. / Bogdanova-Beglarian, Natalia; Zaides, Kristina.
в: CEUR Workshop Proceedings, Том 2813, 2020, стр. 129-139.Результаты исследований: Научные публикации в периодических изданиях › статья в журнале по материалам конференции › Рецензирование
}
TY - JOUR
T1 - Pragmatic markers and parts of speech
T2 - Internet and Modern Society
AU - Bogdanova-Beglarian, Natalia
AU - Zaides, Kristina
N1 - Conference code: 23
PY - 2020
Y1 - 2020
N2 - The article considers the range of possibilities of pragmatic markers (PM) annotation: from the speaker’s code to the speaker’s commentaries for all difficult cases. The research is based on the material of two corpora of everyday Russian speech – “One Day of Speech” (ORD; dialogues / polylogues) and “Balanced Annotated Text Collection” (SAT; monologues). Two main annotation levels have become the objects of research in this paper: the part of speech of the original lexical unit, from which the basic version of the PM had derived (POS), and the model of formation of the PM which consist of more than one word (Model). The research shows the low feasibility of trying to fit PM into the system of traditional parts of speech, and, conversely, the importance and role of defining a model of formation of PM for their systematic description. In any case, the automatic annotation of corpus material turns out to be considerably difficult.
AB - The article considers the range of possibilities of pragmatic markers (PM) annotation: from the speaker’s code to the speaker’s commentaries for all difficult cases. The research is based on the material of two corpora of everyday Russian speech – “One Day of Speech” (ORD; dialogues / polylogues) and “Balanced Annotated Text Collection” (SAT; monologues). Two main annotation levels have become the objects of research in this paper: the part of speech of the original lexical unit, from which the basic version of the PM had derived (POS), and the model of formation of the PM which consist of more than one word (Model). The research shows the low feasibility of trying to fit PM into the system of traditional parts of speech, and, conversely, the importance and role of defining a model of formation of PM for their systematic description. In any case, the automatic annotation of corpus material turns out to be considerably difficult.
KW - Model of formation
KW - Part of speech
KW - Pragmatic marker
KW - Pragmaticalization
KW - Speech corpus
KW - Spoken speech
UR - http://www.scopus.com/inward/record.url?scp=85101624202&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85101624202
VL - 2813
SP - 129
EP - 139
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
SN - 1613-0073
Y2 - 17 June 2020 through 20 June 2020
ER -
ID: 87684014