Sociolinguistic Extension of the ORD Corpus of Russian Everyday Speech

Links

https://link.springer.com/chapter/10.1007%2F978-3-319-43958-7_80
Final published version

DOI

https://doi.org/10.1007/978-3-319-43958-7_80
Final published version

The ORD corpus is one of the largest resources of contemporary spoken Russian. By 2014, its collection numbered about 400 h of recordings made by a group of 40 respondents (20 men and 20 women, of different ages and professions), who volunteered to spend a whole day with a switched-on voice recorder, recording all their verbal communication. The corpus presents the unique linguistic material recorded in natural communicative situations, allowing spoken Russian and the everyday discourse to be studied in many aspects. However, the original sample of respondents was not sufficient enough to study a sociolinguistic variation of speech. Thus, it was decided to launch a large project aiming at the ORD sociolinguistic extension, which was supported by the Russian Science Foundation. The paper describes the general principles for the sociolinguistic extension of the corpus.

Original language	English
Title of host publication	Speech and Computer
Subtitle of host publication	18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings
Publisher	Springer Nature
Pages	659-666
ISBN (Electronic)	978-3-319-43958-7
ISBN (Print)	978-3-319-43957-0
DOIs	https://doi.org/10.1007/978-3-319-43958-7_80
State	Published - 2016
Event	18 International Conference on Speech and Computer - Budapest, Hungary, Budapest, Hungary Duration: 23 Aug 2016 → 27 Aug 2016 Conference number: 18 http://www.specom2016.hte.hu/

Publication series

Name	Lecture Notes in Computer Science
Publisher	Springer Nature
Volume	9811
ISSN (Print)	0302-9743

Conference

Conference	18 International Conference on Speech and Computer
Abbreviated title	Specom 2016
Country/Territory	Hungary
City	Budapest
Period	23/08/16 → 27/08/16
Internet address	http://www.specom2016.hte.hu/

Scopus subject areas

Language and Linguistics
Information Systems

Research areas

Speech corpus, Everyday spoken Russian, Oral communication, Sociolinguistics, Social groupings, Sociolects, Speech variation

ID: 71354161