In this paper we analyze news text collections (clusters) via extracting their paraphrase headlines into a paraphrase graph and working with this graph. Our aim is to test whether news headline is an appropriate form of news text compression. Different types of news collections: dynamic, static and combined (both dynamic and static) clusters are analyzed and it is shown that their respective paraphrase graphs reflect the characteristics of the texts. We also automatically extract the most informationally important linked fragments of news texts, and these fragments characterize news texts as either informative, conveying some information, or publicistic ones, trying to affect the readers emotionally. It is shown that news headlines of the informative type do represent their respective compressed news reports.
Original languageEnglish
Title of host publicationSocial Informatics
Subtitle of host publication10th International Conference, SocInfo 2018, St. Petersburg, Russia, September 25-28, 2018, Proceedings, Part II
PublisherSpringer Nature
Pages139-147
Number of pages9
Volume11186 LNCS
ISBN (Electronic)978-3-030-01159-8
ISBN (Print)978-3-030-01158-1
DOIs
StatePublished - 2018

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume11186
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

    Research areas

  • News cluster, Paraphrase graph, Paraphrase extraction, Linked text segments, Text analysis

    Scopus subject areas

  • Computer Science (miscellaneous)
  • Language and Linguistics

ID: 37683657