Результаты исследований: Научные публикации в периодических изданиях › статья в журнале по материалам конференции › Рецензирование
Morphological tagging of Russian texts of the XIXth century. / Zakharov, Victor; Volkov, Sergei.
в: Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science), Том 3206, 01.12.2004, стр. 235-242.Результаты исследований: Научные публикации в периодических изданиях › статья в журнале по материалам конференции › Рецензирование
}
TY - JOUR
T1 - Morphological tagging of Russian texts of the XIXth century
AU - Zakharov, Victor
AU - Volkov, Sergei
PY - 2004/12/1
Y1 - 2004/12/1
N2 - Tagging Russian texts of the XIXth century has been evaluated. The causes have been determined why some words turned out to be unknown to the tagger, i.e. remained without lemmas and grammatical features. The investigation showed that the main reasons of the existence of the unknown words were as follows: 1) incompleteness of the tagger dictionary, particularly in the XIXth century lexical stock; 2) failure to tag the word-formative derivates; 3) problems with some inflexion models of Old Russian; 4) insufficiency of graphemic analysis; 5) inability of taggers to process multiwords. The results obtained provide a baseline to improve premorphological processing of Russian texts and to work out the more sophisticated approaches to morphological analysis.
AB - Tagging Russian texts of the XIXth century has been evaluated. The causes have been determined why some words turned out to be unknown to the tagger, i.e. remained without lemmas and grammatical features. The investigation showed that the main reasons of the existence of the unknown words were as follows: 1) incompleteness of the tagger dictionary, particularly in the XIXth century lexical stock; 2) failure to tag the word-formative derivates; 3) problems with some inflexion models of Old Russian; 4) insufficiency of graphemic analysis; 5) inability of taggers to process multiwords. The results obtained provide a baseline to improve premorphological processing of Russian texts and to work out the more sophisticated approaches to morphological analysis.
UR - http://www.scopus.com/inward/record.url?scp=22944489901&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:22944489901
VL - 3206
SP - 235
EP - 242
JO - Lecture Notes in Computer Science
JF - Lecture Notes in Computer Science
SN - 0302-9743
T2 - 7th International Conference TSD 2004: Text, Speech and Dialogue
Y2 - 8 September 2004 through 11 September 2004
ER -
ID: 30268644