Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
Writer identification based on letter frequency distribution. / Diurdeva, Polina; Mikhailova, Elena; Shalymov, Dmitry.
19th Conference of Open Innovations Association, FRUCT 2016. ed. / Tatiana Tyutina; Sergey Balandin. Institute of Electrical and Electronics Engineers Inc., 2016. p. 24-30 7892179.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
}
TY - GEN
T1 - Writer identification based on letter frequency distribution
AU - Diurdeva, Polina
AU - Mikhailova, Elena
AU - Shalymov, Dmitry
PY - 2016
Y1 - 2016
N2 - Lately writer identification problem has become actual due to huge amount of documents in digital form. In the current work an approach based on frequency combination of letters is investigated for solving such a task as classification of documents by authorship. This research examines and compares four different distance measures between a text of unknown authorship and an authors' profile: L1 measure, Kullback-Leibler divergence, base metric of Common TV-gram method (OVG)[8] and certain variation of dissimilarity measure of CNG method which was proposed in [12]. Comparison outlines cases when some metric outperforms others with a specific parameter combination. Experiments are conducted on different Russian and English corpora.
AB - Lately writer identification problem has become actual due to huge amount of documents in digital form. In the current work an approach based on frequency combination of letters is investigated for solving such a task as classification of documents by authorship. This research examines and compares four different distance measures between a text of unknown authorship and an authors' profile: L1 measure, Kullback-Leibler divergence, base metric of Common TV-gram method (OVG)[8] and certain variation of dissimilarity measure of CNG method which was proposed in [12]. Comparison outlines cases when some metric outperforms others with a specific parameter combination. Experiments are conducted on different Russian and English corpora.
UR - http://www.scopus.com/inward/record.url?scp=85018627306&partnerID=8YFLogxK
U2 - 10.23919/FRUCT.2016.7892179
DO - 10.23919/FRUCT.2016.7892179
M3 - Conference contribution
SP - 24
EP - 30
BT - 19th Conference of Open Innovations Association, FRUCT 2016
A2 - Tyutina, Tatiana
A2 - Balandin, Sergey
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 19th Conference of Open Innovations Association, FRUCT 2016
Y2 - 7 November 2016 through 11 November 2016
ER -
ID: 7614470