Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
Text Preprocessing for Keyword and Key Phrase Extraction. / Troshina, A.
Internet and Modern Society. Human-Computer Communication . Springer Nature, 2026. стр. 105-112 (Communications in Computer and Information Science; Том 2534 CCIS).Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
}
TY - GEN
T1 - Text Preprocessing for Keyword and Key Phrase Extraction
AU - Troshina, A.
N1 - Conference code: XXVII
PY - 2026
Y1 - 2026
N2 - This article explores various text preprocessing techniques for the extraction of keywords and key phrases. It delves into methods such as text lemmatization, stop-word removal, and number removal, comparing their efficacy with unprocessed text in keyword extraction. Evaluation is based on the ability of keyword sets to retrieve relevant news articles from search engine queries. The study employs multiple keyword extraction tools for comprehensive analysis. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
AB - This article explores various text preprocessing techniques for the extraction of keywords and key phrases. It delves into methods such as text lemmatization, stop-word removal, and number removal, comparing their efficacy with unprocessed text in keyword extraction. Evaluation is based on the ability of keyword sets to retrieve relevant news articles from search engine queries. The study employs multiple keyword extraction tools for comprehensive analysis. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
KW - Keyword Extraction
KW - Lemmatization
KW - Search Engine
KW - Stop-Word Removal
KW - Text Preprocessing
KW - Extraction
KW - Information management
KW - Query processing
KW - Text processing
KW - Key-phrase
KW - Key-phrases extractions
KW - Keywords extraction
KW - News articles
KW - Pre-processing techniques
KW - Stop word
KW - Stop-word removal
KW - Text preprocessing
KW - Word removals
KW - Search engines
UR - https://www.mendeley.com/catalogue/e3efd923-3237-3b7f-905c-7df86fee7629/
U2 - 10.1007/978-3-031-96177-9_9
DO - 10.1007/978-3-031-96177-9_9
M3 - статья в сборнике материалов конференции
SN - 9783031961762
T3 - Communications in Computer and Information Science
SP - 105
EP - 112
BT - Internet and Modern Society. Human-Computer Communication
PB - Springer Nature
Y2 - 24 June 2024 through 26 June 2024
ER -
ID: 151442754