Standard

Russian secondary prepositions : Methodology of analysis. / Zakharov, Victor; Golovina, Anastasia; Alexeeva, Elena; Gudkov, Vadim.

In: CEUR Workshop Proceedings, Vol. 2780, 2020, p. 187-201.

Research output: Contribution to journalConference articlepeer-review

Harvard

APA

Vancouver

Author

BibTeX

@article{c82e8a5c54094c889805b40a1f5c0c88,
title = "Russian secondary prepositions: Methodology of analysis",
abstract = "The present study proposes a methodology of a corpus-based analysis of Russian secondary prepositions, primarily focusing on multiwords. Secondary prepositions are units motivated by content words (nouns, adverbs, verbs), which may be combined with primary prepositions to form multiword prepositions (MWPs). Multiword prepositions perform the grammatical function of a preposition in a certain position of a syntactic structure in some contexts and can be a free combination in others. A strict division between secondary multiword prepositions and equivalent free word combinations is not specified. This presents an issue in the task of building a language model as compound prepositional units are commonly mislabeled as free combinations or are labelled inconsistently, thus leading to parsing errors with far-reaching consequences. Our larger study aims at solving this problem by identifying, describing and eventually formalizing the full inventory of Russian MWPs, which demands a special corpus-based research. This paper is devoted to statistical analysis of the use of secondary multiword prepositions in corpora using prepositions expressing causal relations as the base material. The features of multiword prepositions in the function of a preposition are described. Statistical data on the ratio of the use of individual multiword expressions as prepositional units and as free combinations are provided.",
keywords = "Corpus statistics, Multiword prepositions, Russian language, Secondary prepositions",
author = "Victor Zakharov and Anastasia Golovina and Elena Alexeeva and Vadim Gudkov",
note = "Publisher Copyright: Copyright {\textcopyright} 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).; 2020 Computational Models in Language and Speech Workshop, CMLS 2020 ; Conference date: 12-11-2020 Through 13-11-2020",
year = "2020",
language = "English",
volume = "2780",
pages = "187--201",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "RWTH Aahen University",

}

RIS

TY - JOUR

T1 - Russian secondary prepositions

T2 - 2020 Computational Models in Language and Speech Workshop, CMLS 2020

AU - Zakharov, Victor

AU - Golovina, Anastasia

AU - Alexeeva, Elena

AU - Gudkov, Vadim

N1 - Publisher Copyright: Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).

PY - 2020

Y1 - 2020

N2 - The present study proposes a methodology of a corpus-based analysis of Russian secondary prepositions, primarily focusing on multiwords. Secondary prepositions are units motivated by content words (nouns, adverbs, verbs), which may be combined with primary prepositions to form multiword prepositions (MWPs). Multiword prepositions perform the grammatical function of a preposition in a certain position of a syntactic structure in some contexts and can be a free combination in others. A strict division between secondary multiword prepositions and equivalent free word combinations is not specified. This presents an issue in the task of building a language model as compound prepositional units are commonly mislabeled as free combinations or are labelled inconsistently, thus leading to parsing errors with far-reaching consequences. Our larger study aims at solving this problem by identifying, describing and eventually formalizing the full inventory of Russian MWPs, which demands a special corpus-based research. This paper is devoted to statistical analysis of the use of secondary multiword prepositions in corpora using prepositions expressing causal relations as the base material. The features of multiword prepositions in the function of a preposition are described. Statistical data on the ratio of the use of individual multiword expressions as prepositional units and as free combinations are provided.

AB - The present study proposes a methodology of a corpus-based analysis of Russian secondary prepositions, primarily focusing on multiwords. Secondary prepositions are units motivated by content words (nouns, adverbs, verbs), which may be combined with primary prepositions to form multiword prepositions (MWPs). Multiword prepositions perform the grammatical function of a preposition in a certain position of a syntactic structure in some contexts and can be a free combination in others. A strict division between secondary multiword prepositions and equivalent free word combinations is not specified. This presents an issue in the task of building a language model as compound prepositional units are commonly mislabeled as free combinations or are labelled inconsistently, thus leading to parsing errors with far-reaching consequences. Our larger study aims at solving this problem by identifying, describing and eventually formalizing the full inventory of Russian MWPs, which demands a special corpus-based research. This paper is devoted to statistical analysis of the use of secondary multiword prepositions in corpora using prepositions expressing causal relations as the base material. The features of multiword prepositions in the function of a preposition are described. Statistical data on the ratio of the use of individual multiword expressions as prepositional units and as free combinations are provided.

KW - Corpus statistics

KW - Multiword prepositions

KW - Russian language

KW - Secondary prepositions

UR - http://www.scopus.com/inward/record.url?scp=85098227662&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85098227662

VL - 2780

SP - 187

EP - 201

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

Y2 - 12 November 2020 through 13 November 2020

ER -

ID: 84462130