Documents

The existing Russian corpora do not yet provide opportunities for a systematic analysis of the language of official documents. There are few such texts in existing corpora. Moreover, there are the problems of genre classification and markup of non-fiction (incl. official, legal) texts. The paper describes the initial stage of the creation of the corpus of Russian internal documents and acts of CorRIDA. In everyday life, Russian speakers are increasingly faced with the need to read and sign various official documents. Usually these are so-called internal Documents, for example, Contracts or Consents. However, the language of such documents has not been examined with the use of corpus methodology. The corpus contains 1.5 million words, includes documents belonging to three socially significant domains (health, education, culture) and will allow the description of internal documents of various genres.
Translated title of the contributionCORPUS OF RUSSIAN INTERNAL DOCUMENTS AND ACTS CORRIDA: GOALS, COMPOSITION AND STRUCTURE
Original languageRussian
Title of host publicationКомпьютерная лингвистика и вычислительные онтологии. Выпуск 2
Subtitle of host publicationТруды XXI Международной объединенной научной конференции «Интернет и современное общество», IMS-2017, Санкт-Петербург, 31 мая – 2 июня 2018 г. Сборник научных статей
EditorsА.В. Добров, В.П. Захаров, О.А. Митрофанова, М.В. Хохлова
Place of PublicationСПб.
PublisherНИУ ИТМО
Pages112-120
ISBN (Print)978-5-7577-0584-2
StatePublished - 2018

    Scopus subject areas

  • Arts and Humanities(all)

    Research areas

  • official texts, Corpus of Russian Internal Documents, genres, Socially Important Domains

ID: 36371138