Standard

Readability and Scientific Texts Quality for the Automatic Summarization. / Makarova, Olga; Yagunova, Elena.

2015. 28 Abstract from Multilingualism in Specialized Communication: Challenges and Opportunities in the Digital Age. 20th European Symposium on Languages for Special Purposes, Vienna, Austria.

Research output: Contribution to conferenceAbstract

Harvard

Makarova, O & Yagunova, E 2015, 'Readability and Scientific Texts Quality for the Automatic Summarization', Multilingualism in Specialized Communication: Challenges and Opportunities in the Digital Age. 20th European Symposium on Languages for Special Purposes, Vienna, Austria, 7/07/15 - 9/07/15 pp. 28. <https://fedora.phaidra.univie.ac.at/fedora/get/o:406394/bdef:Content/download>

APA

Makarova, O., & Yagunova, E. (2015). Readability and Scientific Texts Quality for the Automatic Summarization. 28. Abstract from Multilingualism in Specialized Communication: Challenges and Opportunities in the Digital Age. 20th European Symposium on Languages for Special Purposes, Vienna, Austria. https://fedora.phaidra.univie.ac.at/fedora/get/o:406394/bdef:Content/download

Vancouver

Makarova O, Yagunova E. Readability and Scientific Texts Quality for the Automatic Summarization. 2015. Abstract from Multilingualism in Specialized Communication: Challenges and Opportunities in the Digital Age. 20th European Symposium on Languages for Special Purposes, Vienna, Austria.

Author

Makarova, Olga ; Yagunova, Elena. / Readability and Scientific Texts Quality for the Automatic Summarization. Abstract from Multilingualism in Specialized Communication: Challenges and Opportunities in the Digital Age. 20th European Symposium on Languages for Special Purposes, Vienna, Austria.

BibTeX

@conference{5f8bca60f82b484f82ca2942614bb483,
title = "Readability and Scientific Texts Quality for the Automatic Summarization",
abstract = "Automatic summarization is a well known problem in natural language processing with many applications in different areas. Automatic creation of summaries for scientific texts may be useful in bibliographic databases, professional and science communications and education. Summarization systems for scientific texts only sometimes use specific style features and academic writing traditions to increase quality. In this work we describe a flexible automatic summarization system for scientific papers written in Russian that chooses a strategy based on the “”substyle”” of an input text. We implemented three main approaches to extraction-based summarization: statistical (n-gram tf-idf), structural (text positions) and semantic (lexical chains). Using simple text characteristics, such as paragraph length, number of sections and number of paragraphs, and integrative features (entropy and readability) system can decide which combination of methods and weights will produce a better summary.",
keywords = "automatic summarization text entropy readability scientific texts lexical chains",
author = "Olga Makarova and Elena Yagunova",
year = "2015",
language = "не определен",
pages = "28",
note = "null ; Conference date: 07-07-2015 Through 09-07-2015",
url = "https://lsp2015.univie.ac.at/",

}

RIS

TY - CONF

T1 - Readability and Scientific Texts Quality for the Automatic Summarization

AU - Makarova, Olga

AU - Yagunova, Elena

PY - 2015

Y1 - 2015

N2 - Automatic summarization is a well known problem in natural language processing with many applications in different areas. Automatic creation of summaries for scientific texts may be useful in bibliographic databases, professional and science communications and education. Summarization systems for scientific texts only sometimes use specific style features and academic writing traditions to increase quality. In this work we describe a flexible automatic summarization system for scientific papers written in Russian that chooses a strategy based on the “”substyle”” of an input text. We implemented three main approaches to extraction-based summarization: statistical (n-gram tf-idf), structural (text positions) and semantic (lexical chains). Using simple text characteristics, such as paragraph length, number of sections and number of paragraphs, and integrative features (entropy and readability) system can decide which combination of methods and weights will produce a better summary.

AB - Automatic summarization is a well known problem in natural language processing with many applications in different areas. Automatic creation of summaries for scientific texts may be useful in bibliographic databases, professional and science communications and education. Summarization systems for scientific texts only sometimes use specific style features and academic writing traditions to increase quality. In this work we describe a flexible automatic summarization system for scientific papers written in Russian that chooses a strategy based on the “”substyle”” of an input text. We implemented three main approaches to extraction-based summarization: statistical (n-gram tf-idf), structural (text positions) and semantic (lexical chains). Using simple text characteristics, such as paragraph length, number of sections and number of paragraphs, and integrative features (entropy and readability) system can decide which combination of methods and weights will produce a better summary.

KW - automatic summarization text entropy readability scientific texts lexical chains

M3 - тезисы

SP - 28

Y2 - 7 July 2015 through 9 July 2015

ER -

ID: 6935423