Sociolinguistic Extension of the ORD Corpus of Russian Everyday Speech

Research output

15 Citations (Scopus)

Abstract

The ORD corpus is one of the largest resources of contemporary spoken Russian. By 2014, its collection numbered about 400 h of recordings made by a group of 40 respondents (20 men and 20 women, of different ages and professions), who volunteered to spend a whole day with a switched-on voice recorder, recording all their verbal communication. The corpus presents the unique linguistic material recorded in natural communicative situations, allowing spoken Russian and the everyday discourse to be studied in many aspects. However, the original sample of respondents was not sufficient enough to study a sociolinguistic variation of speech. Thus, it was decided to launch a large project aiming at the ORD sociolinguistic extension, which was supported by the Russian Science Foundation. The paper describes the general principles for the sociolinguistic extension of the corpus.
Original languageEnglish
Pages (from-to)659-666
JournalLecture Notes in Computer Science
Volume9811
DOIs
Publication statusPublished - 2016
EventInternational Conference on Speech and Computer - Budapest
Duration: 23 Aug 201627 Aug 2016
http://www.specom2016.hte.hu/

Scopus subject areas

  • Language and Linguistics
  • Information Systems

Cite this

@article{03920cf9147b4d8bb06574632caca348,
title = "Sociolinguistic Extension of the ORD Corpus of Russian Everyday Speech",
abstract = "The ORD corpus is one of the largest resources of contemporary spoken Russian. By 2014, its collection numbered about 400 h of recordings made by a group of 40 respondents (20 men and 20 women, of different ages and professions), who volunteered to spend a whole day with a switched-on voice recorder, recording all their verbal communication. The corpus presents the unique linguistic material recorded in natural communicative situations, allowing spoken Russian and the everyday discourse to be studied in many aspects. However, the original sample of respondents was not sufficient enough to study a sociolinguistic variation of speech. Thus, it was decided to launch a large project aiming at the ORD sociolinguistic extension, which was supported by the Russian Science Foundation. The paper describes the general principles for the sociolinguistic extension of the corpus.",
keywords = "Speech corpus, Everyday spoken Russian, Oral communication, Sociolinguistics, Social groupings, Sociolects Speech variation",
author = "N. Bogdanova-Beglarian and T. Sherstinova and O. Blinova and O. Ermolova and E. Baeva and G. Martynenko and A. Ryko",
year = "2016",
doi = "10.1007/978-3-319-43958-7_80",
language = "English",
volume = "9811",
pages = "659--666",
journal = "Lecture Notes in Computer Science",
issn = "0302-9743",
publisher = "Springer",

}

TY - JOUR

T1 - Sociolinguistic Extension of the ORD Corpus of Russian Everyday Speech

AU - Bogdanova-Beglarian, N.

AU - Sherstinova, T.

AU - Blinova, O.

AU - Ermolova, O.

AU - Baeva, E.

AU - Martynenko, G.

AU - Ryko, A.

PY - 2016

Y1 - 2016

N2 - The ORD corpus is one of the largest resources of contemporary spoken Russian. By 2014, its collection numbered about 400 h of recordings made by a group of 40 respondents (20 men and 20 women, of different ages and professions), who volunteered to spend a whole day with a switched-on voice recorder, recording all their verbal communication. The corpus presents the unique linguistic material recorded in natural communicative situations, allowing spoken Russian and the everyday discourse to be studied in many aspects. However, the original sample of respondents was not sufficient enough to study a sociolinguistic variation of speech. Thus, it was decided to launch a large project aiming at the ORD sociolinguistic extension, which was supported by the Russian Science Foundation. The paper describes the general principles for the sociolinguistic extension of the corpus.

AB - The ORD corpus is one of the largest resources of contemporary spoken Russian. By 2014, its collection numbered about 400 h of recordings made by a group of 40 respondents (20 men and 20 women, of different ages and professions), who volunteered to spend a whole day with a switched-on voice recorder, recording all their verbal communication. The corpus presents the unique linguistic material recorded in natural communicative situations, allowing spoken Russian and the everyday discourse to be studied in many aspects. However, the original sample of respondents was not sufficient enough to study a sociolinguistic variation of speech. Thus, it was decided to launch a large project aiming at the ORD sociolinguistic extension, which was supported by the Russian Science Foundation. The paper describes the general principles for the sociolinguistic extension of the corpus.

KW - Speech corpus

KW - Everyday spoken Russian

KW - Oral communication

KW - Sociolinguistics

KW - Social groupings

KW - Sociolects Speech variation

U2 - 10.1007/978-3-319-43958-7_80

DO - 10.1007/978-3-319-43958-7_80

M3 - Article

VL - 9811

SP - 659

EP - 666

JO - Lecture Notes in Computer Science

JF - Lecture Notes in Computer Science

SN - 0302-9743

ER -