Russian Sentence Corpus

Benchmark measures of eye movements in reading in Russian

Anna K. Laurinavichyute, Irina A. Sekerina, Svetlana Alexeeva, Kristine Bagdasaryan, Reinhold Kliegl

Research output: Contribution to journalArticleResearchpeer-review

1 Citation (Scopus)

Abstract

This article introduces a new corpus of eye movements in silent reading—the Russian Sentence Corpus (RSC). Russian uses the Cyrillic script, which has not yet been investigated in cross-linguistic eye movement research. As in every language studied so far, we confirmed the expected effects of low-level parameters, such as word length, frequency, and predictability, on the eye movements of skilled Russian readers. These findings allow us to add Slavic languages using Cyrillic script (exemplified by Russian) to the growing number of languages with different orthographies, ranging from the Roman-based European languages to logographic Asian ones, whose basic eye movement benchmarks conform to the universal comparative science of reading (Share, 2008). We additionally report basic descriptive corpus statistics and three exploratory investigations of the effects of Russian morphology on the basic eye movement measures, which illustrate the kinds of questions that researchers can answer using the RSC. The annotated corpus is freely available from its project page at the Open Science Framework: https://osf.io/x5q2r/.

Original languageEnglish
Pages (from-to)1161-‌1178
Number of pages18
JournalBehavior Research Methods
Volume51
Issue number3
DOIs
StatePublished - 15 Jun 2019

Keywords

  • Ambiguity
  • Corpus
  • Eye movements
  • Part of speech
  • Reading
  • Russian

Scopus subject areas

  • Experimental and Cognitive Psychology
  • Developmental and Educational Psychology
  • Arts and Humanities (miscellaneous)
  • Psychology (miscellaneous)
  • Psychology(all)

Cite this

Laurinavichyute, Anna K. ; Sekerina, Irina A. ; Alexeeva, Svetlana ; Bagdasaryan, Kristine ; Kliegl, Reinhold. / Russian Sentence Corpus : Benchmark measures of eye movements in reading in Russian. In: Behavior Research Methods. 2019 ; Vol. 51, No. 3. pp. 1161-‌1178.
@article{9fb4ef035dcf4012a24b2473b3b7cc47,
title = "Russian Sentence Corpus: Benchmark measures of eye movements in reading in Russian",
abstract = "This article introduces a new corpus of eye movements in silent reading—the Russian Sentence Corpus (RSC). Russian uses the Cyrillic script, which has not yet been investigated in cross-linguistic eye movement research. As in every language studied so far, we confirmed the expected effects of low-level parameters, such as word length, frequency, and predictability, on the eye movements of skilled Russian readers. These findings allow us to add Slavic languages using Cyrillic script (exemplified by Russian) to the growing number of languages with different orthographies, ranging from the Roman-based European languages to logographic Asian ones, whose basic eye movement benchmarks conform to the universal comparative science of reading (Share, 2008). We additionally report basic descriptive corpus statistics and three exploratory investigations of the effects of Russian morphology on the basic eye movement measures, which illustrate the kinds of questions that researchers can answer using the RSC. The annotated corpus is freely available from its project page at the Open Science Framework: https://osf.io/x5q2r/.",
keywords = "Ambiguity, Corpus, Eye movements, Part of speech, Reading, Russian",
author = "Laurinavichyute, {Anna K.} and Sekerina, {Irina A.} and Svetlana Alexeeva and Kristine Bagdasaryan and Reinhold Kliegl",
year = "2019",
month = "6",
day = "15",
doi = "10.3758/s13428-018-1051-6",
language = "English",
volume = "51",
pages = "1161--‌1178",
journal = "Behavior Research Methods",
issn = "1554-351X",
publisher = "Springer",
number = "3",

}

Laurinavichyute, AK, Sekerina, IA, Alexeeva, S, Bagdasaryan, K & Kliegl, R 2019, 'Russian Sentence Corpus: Benchmark measures of eye movements in reading in Russian', Behavior Research Methods, vol. 51, no. 3, pp. 1161-‌1178. https://doi.org/10.3758/s13428-018-1051-6

Russian Sentence Corpus : Benchmark measures of eye movements in reading in Russian. / Laurinavichyute, Anna K.; Sekerina, Irina A.; Alexeeva, Svetlana; Bagdasaryan, Kristine; Kliegl, Reinhold.

In: Behavior Research Methods, Vol. 51, No. 3, 15.06.2019, p. 1161-‌1178.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - Russian Sentence Corpus

T2 - Benchmark measures of eye movements in reading in Russian

AU - Laurinavichyute, Anna K.

AU - Sekerina, Irina A.

AU - Alexeeva, Svetlana

AU - Bagdasaryan, Kristine

AU - Kliegl, Reinhold

PY - 2019/6/15

Y1 - 2019/6/15

N2 - This article introduces a new corpus of eye movements in silent reading—the Russian Sentence Corpus (RSC). Russian uses the Cyrillic script, which has not yet been investigated in cross-linguistic eye movement research. As in every language studied so far, we confirmed the expected effects of low-level parameters, such as word length, frequency, and predictability, on the eye movements of skilled Russian readers. These findings allow us to add Slavic languages using Cyrillic script (exemplified by Russian) to the growing number of languages with different orthographies, ranging from the Roman-based European languages to logographic Asian ones, whose basic eye movement benchmarks conform to the universal comparative science of reading (Share, 2008). We additionally report basic descriptive corpus statistics and three exploratory investigations of the effects of Russian morphology on the basic eye movement measures, which illustrate the kinds of questions that researchers can answer using the RSC. The annotated corpus is freely available from its project page at the Open Science Framework: https://osf.io/x5q2r/.

AB - This article introduces a new corpus of eye movements in silent reading—the Russian Sentence Corpus (RSC). Russian uses the Cyrillic script, which has not yet been investigated in cross-linguistic eye movement research. As in every language studied so far, we confirmed the expected effects of low-level parameters, such as word length, frequency, and predictability, on the eye movements of skilled Russian readers. These findings allow us to add Slavic languages using Cyrillic script (exemplified by Russian) to the growing number of languages with different orthographies, ranging from the Roman-based European languages to logographic Asian ones, whose basic eye movement benchmarks conform to the universal comparative science of reading (Share, 2008). We additionally report basic descriptive corpus statistics and three exploratory investigations of the effects of Russian morphology on the basic eye movement measures, which illustrate the kinds of questions that researchers can answer using the RSC. The annotated corpus is freely available from its project page at the Open Science Framework: https://osf.io/x5q2r/.

KW - Ambiguity

KW - Corpus

KW - Eye movements

KW - Part of speech

KW - Reading

KW - Russian

UR - http://www.scopus.com/inward/record.url?scp=85048520753&partnerID=8YFLogxK

UR - http://www.mendeley.com/research/russian-sentence-corpus-benchmark-measures-eye-movements-reading-russian

U2 - 10.3758/s13428-018-1051-6

DO - 10.3758/s13428-018-1051-6

M3 - Article

VL - 51

SP - 1161-‌1178

JO - Behavior Research Methods

JF - Behavior Research Methods

SN - 1554-351X

IS - 3

ER -