Research output: Contribution to conference › Paper › peer-review
Citation Style Classification: a Comparison of Machine Learning Approaches. / Kopan, Artyom; Smirnova, Anna N.; Shchuckin, Ilya; Makeev, Vladislav; Chernishev, George A.
2023. 1058-1064 Paper presented at 2023 IEEE International Conference on Data Mining Workshops, Шанхай, China.Research output: Contribution to conference › Paper › peer-review
}
TY - CONF
T1 - Citation Style Classification: a Comparison of Machine Learning Approaches
AU - Kopan, Artyom
AU - Smirnova, Anna N.
AU - Shchuckin, Ilya
AU - Makeev, Vladislav
AU - Chernishev, George A.
N1 - Conference code: 23
PY - 2023/12/4
Y1 - 2023/12/4
N2 - Citation style classification is a task that aims to detect a citation style according to which a given bibliographic entry is formatted. There are more than a hundred of recognized citation styles available, including popular ones such as ACM, IEEE, MLA, APA, and even more exotic ones. Automatic detection of citation style can be used in document linters, such as those intended for conference and term papers. Using automatic style classification enables people who assess articles in large quantities reduce cognitive load and increase efficiency. Apart from this, automatic detection of citation style can be used for reference parsing, topic classification, and reference extraction.In this paper we propose two novel approaches to citation style classification using both deep and classic machine learning methods. We evaluate them using a specially designed dataset, consisting of 6 million bibliographic records and spanning 91 citation styles. Our experiments showed that the proposed approaches significantly outperform the existing state-of-the-art solution, while also supporting almost five times more citation styles. © 2023 IEEE.
AB - Citation style classification is a task that aims to detect a citation style according to which a given bibliographic entry is formatted. There are more than a hundred of recognized citation styles available, including popular ones such as ACM, IEEE, MLA, APA, and even more exotic ones. Automatic detection of citation style can be used in document linters, such as those intended for conference and term papers. Using automatic style classification enables people who assess articles in large quantities reduce cognitive load and increase efficiency. Apart from this, automatic detection of citation style can be used for reference parsing, topic classification, and reference extraction.In this paper we propose two novel approaches to citation style classification using both deep and classic machine learning methods. We evaluate them using a specially designed dataset, consisting of 6 million bibliographic records and spanning 91 citation styles. Our experiments showed that the proposed approaches significantly outperform the existing state-of-the-art solution, while also supporting almost five times more citation styles. © 2023 IEEE.
KW - citation
KW - citation classification
KW - citation style classification
KW - reference classification
KW - Deep learning
KW - Automatic Detection
KW - Bibliographic records
KW - Citation
KW - Citation classification
KW - Citation style classification
KW - Cognitive loads
KW - Machine learning approaches
KW - Machine learning methods
KW - Reference classification
KW - Topic Classification
KW - Bibliographies
U2 - 10.1109/ICDMW60847.2023.00139
DO - 10.1109/ICDMW60847.2023.00139
M3 - Paper
SP - 1058
EP - 1064
T2 - 2023 IEEE International Conference on Data Mining Workshops
Y2 - 4 December 2023 through 4 December 2023
ER -
ID: 116882486