Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
Extract summarization algorithms help identify significant information from the news by extracting meaningful sentences from the original text. The information background existing at the time of the news release often significantly affects its content. Such background can distort the text summarization algorithm working results. The study was conducted with the example of the theme “coronavirus” (COVID-19), which at the time of the study was one of the main topics in news feeds. Experiments were carried out on sports news articles, concerned football. This news area was selected because it is not related to medical topics. The TextRank algorithm for sport news extraction was applied in two ways. First, the key information from the source text of news was extracted. Then, a list of the COVID related words was created and the key information from news without considering words from this list was extracted. Our approach showed that mentioning a popular theme such as COVID that is not related to sports can have a negative impact on the text summarization algorithm. We suggest that to obtain accurate results of the algorithm operation, it is necessary to first compile a dictionary of terms related to the coronavirus theme and then exclude them when identifying the main content of news texts.
Original language | English |
---|---|
Title of host publication | Computational Data and Social Networks - 10th International Conference, CSoNet 2021, Proceedings |
Editors | David Mohaisen, Ruoming Jin |
Publisher | Springer Nature |
Pages | 351-360 |
Number of pages | 10 |
ISBN (Print) | 9783030914332 |
DOIs | |
State | Published - 2021 |
Event | 10th International Conference on Computational Data and Social Networks, CSoNet 2021 - Virtual Online Duration: 15 Nov 2021 → 17 Nov 2021 |
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 13116 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference | 10th International Conference on Computational Data and Social Networks, CSoNet 2021 |
---|---|
City | Virtual Online |
Period | 15/11/21 → 17/11/21 |
ID: 91076367