Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
Analytical Distribution Model for Syntactic Variables Average Values in Russian literary Texts. / Martynenko, Gregory ; Sherstinova, Tatiana .
Digital Transformation and Global Society: 4th International Conference, DTGS 2019, St. Petersburg, Russia, June 19–21, 2019, Revised Selected Papers. ред. / D.A. Alexandrov ; et al. Cham : Springer Nature, 2019. стр. 719–731 (Communications in Computer and Information Science; Том 1038).Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
}
TY - GEN
T1 - Analytical Distribution Model for Syntactic Variables Average Values in Russian literary Texts
AU - Martynenko, Gregory
AU - Sherstinova, Tatiana
N1 - Conference code: 4
PY - 2019
Y1 - 2019
N2 - Digital technologies provide new possibilities for studying cultural heritage. Thus, literature research involving large text corpora allows to set and solve theoretical problems which previously had no prospects for their decision. For example, it has become possible to model the literary system for some definite literary period (i.e., for the Silver Age of Russian literature) and to classify prose writers according to their stylistic features. And more than that, it allows to solve more general theoretical problems. The given research was conducted on Russian literary texts of the early 20th century. The sample included 100 short stories by 100 different writers. The measurements were carried out for 5 syntactic variables. For each of these distributions, the most popular statistics were calculated. Basing on these data, we consider empirical verification of Lyapunov’s central limit theorem (CLT). The article validates the effectiveness of CLT theorem and the conditions for its implementation. Besides the normal (Gaussian) function we used another analytical model—the Hausstein function. It turned out that both theoretical distributions for each of five variables do not contradict the experimental data. However, the alternative analytical model (Hausstein function) has shown even better agreement with the experimental data. The obtained results may be used in computational linguistic studies and for research of Russian literary heritage.
AB - Digital technologies provide new possibilities for studying cultural heritage. Thus, literature research involving large text corpora allows to set and solve theoretical problems which previously had no prospects for their decision. For example, it has become possible to model the literary system for some definite literary period (i.e., for the Silver Age of Russian literature) and to classify prose writers according to their stylistic features. And more than that, it allows to solve more general theoretical problems. The given research was conducted on Russian literary texts of the early 20th century. The sample included 100 short stories by 100 different writers. The measurements were carried out for 5 syntactic variables. For each of these distributions, the most popular statistics were calculated. Basing on these data, we consider empirical verification of Lyapunov’s central limit theorem (CLT). The article validates the effectiveness of CLT theorem and the conditions for its implementation. Besides the normal (Gaussian) function we used another analytical model—the Hausstein function. It turned out that both theoretical distributions for each of five variables do not contradict the experimental data. However, the alternative analytical model (Hausstein function) has shown even better agreement with the experimental data. The obtained results may be used in computational linguistic studies and for research of Russian literary heritage.
KW - Russian literary texts
KW - Russian short stories
KW - Digital culture
KW - Syntax
KW - Stylistic variables
KW - Statistical distributions
KW - statistics
KW - Normal distribution
KW - Central limit theorem
UR - https://link.springer.com/chapter/10.1007/978-3-030-37858-5_61
U2 - 10.1007/978-3-030-37858-5
DO - 10.1007/978-3-030-37858-5
M3 - Conference contribution
SN - 9783030378578
T3 - Communications in Computer and Information Science
SP - 719
EP - 731
BT - Digital Transformation and Global Society
A2 - Alexandrov , D.A.
A2 - et al.,
PB - Springer Nature
CY - Cham
T2 - 4th International Conference on Digital Transformation and Global Society, DTGS 2019
Y2 - 19 June 2019 through 21 June 2019
ER -
ID: 51200239