Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
Keyphrases provide a concise representation of the main content of a document and can be effectively used within information retrieval systems. In the paper, we deal with the keyphrase extraction problem when a given number of keyphrases for a text should be extracted. The research is focused on the keyphrase candidates ranking stage. In the domain, the question remains open of whether the keyphrase extraction quality can be improved by putting limits on the number of phrases of different lengths extracted during candidate ranking. We assume that the quality of resulting keyphrases can be enhanced if we introduce Limitations on the number of phrases of specific Lengths in the resulting set (LL-ranking strategy). The experiments are performed on the well-known INSPEC dataset of scientific abstracts. The obtained results show that the proposed limitations help to significantly increase the quality of extracted keyphrases in terms of P recision and F 1.
Original language | English |
---|---|
Title of host publication | Advances in Intelligent Systems and Computing IV- Selected Papers from the International Conference on Computer Science and Information Technologies, CSIT 2019 |
Editors | Natalya Shakhovska, Mykola O. Medykovskyy |
Publisher | Springer Nature |
Pages | 567-578 |
Number of pages | 12 |
ISBN (Print) | 9783030336943 |
DOIs | |
State | Published - 2020 |
Event | 14th International Scientific and Technical Conference on Computer Science and Information Technologies, CSIT 2019 - Lviv, Ukraine Duration: 17 Sep 2019 → 20 Sep 2019 |
Name | Advances in Intelligent Systems and Computing |
---|---|
Volume | 1080 AISC |
ISSN (Print) | 2194-5357 |
ISSN (Electronic) | 2194-5365 |
Conference | 14th International Scientific and Technical Conference on Computer Science and Information Technologies, CSIT 2019 |
---|---|
Country/Territory | Ukraine |
City | Lviv |
Period | 17/09/19 → 20/09/19 |
ID: 88341382