The paper deals with a corpus of the Russian language of the 19th century. The corpus offers the opportunity to accomplish some essential tasks of modern Russian linguistics like getting various linguistic and statistical information, investigating dynamic processes in the vocabulary, analyzing grammatical changes in the lexicon. To make the corpus representative, some special criteria should be determined. The corpus belongs to corpora with morphological annotation, i.e. each word form has a list of morphological features. Additionally, the metadata set includes identifiers of structural division together with external features. The metadata description is based on international recommendations.

Original languageEnglish
Pages (from-to)146-151
Number of pages6
JournalLecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
Volume2807
StatePublished - 1 Dec 2003
Event6th International Conference, TSD 2003 - Ceske Budejovice
Duration: 8 Sep 200312 Sep 2003

    Scopus subject areas

  • Hardware and Architecture
  • Computer Science(all)
  • Theoretical Computer Science

ID: 30268717