Standard

Pragmatic markers and parts of speech : On the problems of annotation of the speech corpus. / Bogdanova-Beglarian, Natalia; Zaides, Kristina.

In: CEUR Workshop Proceedings, Vol. 2813, 2020, p. 129-139.

Research output: Contribution to journalConference articlepeer-review

Harvard

APA

Vancouver

Author

BibTeX

@article{2d42c4b2b4164da99de2e0646414df1a,
title = "Pragmatic markers and parts of speech: On the problems of annotation of the speech corpus",
abstract = "The article considers the range of possibilities of pragmatic markers (PM) annotation: from the speaker{\textquoteright}s code to the speaker{\textquoteright}s commentaries for all difficult cases. The research is based on the material of two corpora of everyday Russian speech – “One Day of Speech” (ORD; dialogues / polylogues) and “Balanced Annotated Text Collection” (SAT; monologues). Two main annotation levels have become the objects of research in this paper: the part of speech of the original lexical unit, from which the basic version of the PM had derived (POS), and the model of formation of the PM which consist of more than one word (Model). The research shows the low feasibility of trying to fit PM into the system of traditional parts of speech, and, conversely, the importance and role of defining a model of formation of PM for their systematic description. In any case, the automatic annotation of corpus material turns out to be considerably difficult.",
keywords = "Model of formation, Part of speech, Pragmatic marker, Pragmaticalization, Speech corpus, Spoken speech",
author = "Natalia Bogdanova-Beglarian and Kristina Zaides",
note = "Publisher Copyright: Copyright {\textcopyright} 2020 for this paper by its authors.; Internet and Modern Society ; Conference date: 17-06-2020 Through 20-06-2020",
year = "2020",
language = "English",
volume = "2813",
pages = "129--139",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "RWTH Aahen University",
url = "http://ims.ifmo.ru/ru/pages/2/programma.htm",

}

RIS

TY - JOUR

T1 - Pragmatic markers and parts of speech

T2 - Internet and Modern Society

AU - Bogdanova-Beglarian, Natalia

AU - Zaides, Kristina

N1 - Conference code: 23

PY - 2020

Y1 - 2020

N2 - The article considers the range of possibilities of pragmatic markers (PM) annotation: from the speaker’s code to the speaker’s commentaries for all difficult cases. The research is based on the material of two corpora of everyday Russian speech – “One Day of Speech” (ORD; dialogues / polylogues) and “Balanced Annotated Text Collection” (SAT; monologues). Two main annotation levels have become the objects of research in this paper: the part of speech of the original lexical unit, from which the basic version of the PM had derived (POS), and the model of formation of the PM which consist of more than one word (Model). The research shows the low feasibility of trying to fit PM into the system of traditional parts of speech, and, conversely, the importance and role of defining a model of formation of PM for their systematic description. In any case, the automatic annotation of corpus material turns out to be considerably difficult.

AB - The article considers the range of possibilities of pragmatic markers (PM) annotation: from the speaker’s code to the speaker’s commentaries for all difficult cases. The research is based on the material of two corpora of everyday Russian speech – “One Day of Speech” (ORD; dialogues / polylogues) and “Balanced Annotated Text Collection” (SAT; monologues). Two main annotation levels have become the objects of research in this paper: the part of speech of the original lexical unit, from which the basic version of the PM had derived (POS), and the model of formation of the PM which consist of more than one word (Model). The research shows the low feasibility of trying to fit PM into the system of traditional parts of speech, and, conversely, the importance and role of defining a model of formation of PM for their systematic description. In any case, the automatic annotation of corpus material turns out to be considerably difficult.

KW - Model of formation

KW - Part of speech

KW - Pragmatic marker

KW - Pragmaticalization

KW - Speech corpus

KW - Spoken speech

UR - http://www.scopus.com/inward/record.url?scp=85101624202&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85101624202

VL - 2813

SP - 129

EP - 139

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

Y2 - 17 June 2020 through 20 June 2020

ER -

ID: 87684014