Research output: Contribution to journal › Conference article › peer-review
Pragmatic markers in the corpus “One Day of Speech” : Approaches to the annotation. / Zaides, Kristina; Popova, Tatiana; Bogdanova-Beglarian, Natalia.
In: CEUR Workshop Proceedings, Vol. 2303, 01.01.2018, p. 1-16.Research output: Contribution to journal › Conference article › peer-review
}
TY - JOUR
T1 - Pragmatic markers in the corpus “One Day of Speech”
T2 - 2018 International Workshop on Computational Models in Language and Speech, CMLS 2018
AU - Zaides, Kristina
AU - Popova, Tatiana
AU - Bogdanova-Beglarian, Natalia
PY - 2018/1/1
Y1 - 2018/1/1
N2 - The article describes the scheme of the annotation of pragmatic markers in the corpus of Russian everyday speech “One Day of Speech”. Pragmatic markers are defined as special units in the speech that have only pragmatic function without any (or with ‘bleached’) lexical meaning. The annotation of pragmatic markers is usually performed manually due to the existing ambiguity of markers in different contexts. The typology of pragmatic markers includes different groups marked with special annotation tags. The annotation process was split into two stages since several issues of tagging of PMs arose. The main problems, which occurred during the annotation process, and the possible ways of their solution are also discussed in the research. The paper propose the improved methods of problem solving during the annotation of pragmatic markers applied to the corpus of oral speech, which can be useful for the linguistic annotation of any other levels of oral speech.
AB - The article describes the scheme of the annotation of pragmatic markers in the corpus of Russian everyday speech “One Day of Speech”. Pragmatic markers are defined as special units in the speech that have only pragmatic function without any (or with ‘bleached’) lexical meaning. The annotation of pragmatic markers is usually performed manually due to the existing ambiguity of markers in different contexts. The typology of pragmatic markers includes different groups marked with special annotation tags. The annotation process was split into two stages since several issues of tagging of PMs arose. The main problems, which occurred during the annotation process, and the possible ways of their solution are also discussed in the research. The paper propose the improved methods of problem solving during the annotation of pragmatic markers applied to the corpus of oral speech, which can be useful for the linguistic annotation of any other levels of oral speech.
KW - Corpus annotation
KW - Corpus linguistics
KW - Corpus of everyday speech
KW - Pragmatic marker
KW - Spoken speech
UR - http://www.scopus.com/inward/record.url?scp=85060661811&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85060661811
VL - 2303
SP - 1
EP - 16
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
SN - 1613-0073
Y2 - 1 November 2018
ER -
ID: 43825554