DOI

Syntactic complexity is an important feature of any text, both written and oral. The information about syntactic complexity is crucial for successful solution of many practical NLP tasks starting from intellectual understanding of texts and ending with automatic machine translation. Because of this, syntactic complexity and its measures are in the center of attention of NLP developers. Thus far, quite a series of different measures of syntactic complexity have been developed; in this paper, it is proposed to consider 10 syntactic measures that have been proposed for syntactic stylometric analysis. The pilot experiment described in this paper was made on automatic syntactic text annotation made by UDPipe syntactic parser, which was manually corrected. In our approach, particular attention is paid to the analysis of stability of certain measures of syntactic complexity and the analysis of their variation. Thus, we try to evaluate, which syntactic properties of Russian texts may be considered as inherent for the language as a whole, and which of them undergo some changes. To achieve this task, we analyze the corpus of Russian literary texts for three decades. Due to their high stylistic variability, texts of fiction may be considered as excellent data for assessing different levels of complexity. The obtained results show the effectiveness of different measures for estimating text syntactic complexity and revealing their correlation.

Язык оригиналаанглийский
Название основной публикацииProceedings of the 27th Conference of Open Innovations Association FRUCT, FRUCT 2020
РедакторыSergey Balandin, Luca Turchet, Tatiana Tyutina
ИздательInstitute of Electrical and Electronics Engineers Inc.
Страницы221-229
Число страниц9
ISBN (электронное издание)9789526924434
DOI
СостояниеОпубликовано - сен 2020
Опубликовано для внешнего пользованияДа
Событие27th Conference of Open Innovations Association FRUCT, FRUCT 2020 - Virtual, Trento, Италия
Продолжительность: 7 сен 20209 сен 2020

Серия публикаций

НазваниеConference of Open Innovation Association, FRUCT
Том2020-September
ISSN (печатное издание)2305-7254

конференция

конференция27th Conference of Open Innovations Association FRUCT, FRUCT 2020
Страна/TерриторияИталия
ГородVirtual, Trento
Период7/09/209/09/20

    Предметные области Scopus

  • Компьютерные науки (все)
  • Электротехника и электроника

ID: 97769714