Sociolinguistic Variability of Russian Everyday Speech: A Corpus-Based Study

Standard

Sociolinguistic Variability of Russian Everyday Speech: A Corpus-Based Study. / Bogdanova-Beglarian, Natalia V. ; Blinova, Olga V. ; Sherstinova, Tatiana Yu. ; Baeva, Ekaterina M. ; Gorbunova, Daria ; Popova, Tatiana I. .

In: CONFERENCE OF OPEN INNOVATIONS ASSOCIATION, FRUCT, 15.09.2020, p. 288-293.

Research output: Contribution to journal › Conference article

BibTeX

@article{6309754af30543a5a11af415f13d11ef,

title = "Sociolinguistic Variability of Russian Everyday Speech: A Corpus-Based Study",

abstract = "The paper presents recent results of a multilevel analysis of representative corpus data, conducted in order to identify key speech parameters (lexical, morphological and syntactic) that can diagnose some social/biological characteristics of a speaker or, more broadly, a modern Russian urban sociolect. The study is based on the everyday Russian speech corpus “One Speaker{\textquoteright}s Day”. Specific data were obtained on the analysis of the annotated subcorpus of 289,205 tokens, which includes recorded “speech days” of 57 men and 48 women, which were the research participants, as well as speech fragments of 87 men and 139 women, which were their interlocutors. Thus, the total number of speakers in the subsample amounts to 144 men and 187 women. The article also begs the question of Data Mining approach usability to the subcorpus and possibilities of further research using machine learning. The results obtained are important for the optimization of speech technologies systems, for theoretical understanding of linguistic processes, as well as for monitoring various social processes taking place in modern Russian metropolis.",

keywords = "речевой корпус, прагматический маркер, социолингвистика",

author = "Bogdanova-Beglarian, {Natalia V.} and Blinova, {Olga V.} and Sherstinova, {Tatiana Yu.} and Baeva, {Ekaterina M.} and Daria Gorbunova and Popova, {Tatiana I.}",

note = "Bogdanova-Beglarian, N., Baeva E., Blinova O., Sherstinova T., Gorbunova D., Popova T. Sociolinguistic Variability of Russian Everyday Speech: A Corpus-Based Study // Proceedings of the 27th IEEE Conference of the Open Innovations Association FRUCT. FRUCT{\textquoteright}27. The University of Trento (Italy), 7-9 September 2020. Trento, Italy, FRUCT Oy, Finland, Vol. 2 (ACM volume) / S. Balandin, L. Turchet, T. Tyutina. (eds.). Pp. 288 293. ",

year = "2020",

month = sep,

day = "15",

language = "English",

pages = "288--293",

journal = "Conference of Open Innovation Association, FRUCT",

issn = "2305-7254",

publisher = "FRUCT Oy",

}

RIS

TY - JOUR

T1 - Sociolinguistic Variability of Russian Everyday Speech: A Corpus-Based Study

AU - Bogdanova-Beglarian, Natalia V.

AU - Blinova, Olga V.

AU - Sherstinova, Tatiana Yu.

AU - Baeva, Ekaterina M.

AU - Gorbunova, Daria

AU - Popova, Tatiana I.

N1 - Bogdanova-Beglarian, N., Baeva E., Blinova O., Sherstinova T., Gorbunova D., Popova T. Sociolinguistic Variability of Russian Everyday Speech: A Corpus-Based Study // Proceedings of the 27th IEEE Conference of the Open Innovations Association FRUCT. FRUCT’27. The University of Trento (Italy), 7-9 September 2020. Trento, Italy, FRUCT Oy, Finland, Vol. 2 (ACM volume) / S. Balandin, L. Turchet, T. Tyutina. (eds.). Pp. 288 293.

PY - 2020/9/15

Y1 - 2020/9/15

N2 - The paper presents recent results of a multilevel analysis of representative corpus data, conducted in order to identify key speech parameters (lexical, morphological and syntactic) that can diagnose some social/biological characteristics of a speaker or, more broadly, a modern Russian urban sociolect. The study is based on the everyday Russian speech corpus “One Speaker’s Day”. Specific data were obtained on the analysis of the annotated subcorpus of 289,205 tokens, which includes recorded “speech days” of 57 men and 48 women, which were the research participants, as well as speech fragments of 87 men and 139 women, which were their interlocutors. Thus, the total number of speakers in the subsample amounts to 144 men and 187 women. The article also begs the question of Data Mining approach usability to the subcorpus and possibilities of further research using machine learning. The results obtained are important for the optimization of speech technologies systems, for theoretical understanding of linguistic processes, as well as for monitoring various social processes taking place in modern Russian metropolis.

AB - The paper presents recent results of a multilevel analysis of representative corpus data, conducted in order to identify key speech parameters (lexical, morphological and syntactic) that can diagnose some social/biological characteristics of a speaker or, more broadly, a modern Russian urban sociolect. The study is based on the everyday Russian speech corpus “One Speaker’s Day”. Specific data were obtained on the analysis of the annotated subcorpus of 289,205 tokens, which includes recorded “speech days” of 57 men and 48 women, which were the research participants, as well as speech fragments of 87 men and 139 women, which were their interlocutors. Thus, the total number of speakers in the subsample amounts to 144 men and 187 women. The article also begs the question of Data Mining approach usability to the subcorpus and possibilities of further research using machine learning. The results obtained are important for the optimization of speech technologies systems, for theoretical understanding of linguistic processes, as well as for monitoring various social processes taking place in modern Russian metropolis.

KW - речевой корпус, прагматический маркер, социолингвистика

UR - https://elibrary.ru/item.asp?id=43968561

M3 - Conference article

SP - 288

EP - 293

JO - Conference of Open Innovation Association, FRUCT

JF - Conference of Open Innovation Association, FRUCT

SN - 2305-7254

ER -

ID: 76986751