In this paper an approach to primary business-media analysis for further information extraction is proposed. We consider business events representation by looking into part of speech (POS) distribution across tagged n-grams. Two Russian business-media corpora, Russian Business Consulting (RBC) and Commersant, are analyzed, and it is shown that they differ not only in style or themes coverage but also in the range of contexts for the words which mark business-events. Purchase, merger and ownership events are given a closer look at, and it is shown that they are mostly represented by noun phrases in both corpora rather than verbal phrases.
Original languageEnglish
Title of host publicationResearch in Computing Science: Advances in Soft Computing Techniques
Pages103-114
StatePublished - 2013
Externally publishedYes

    Research areas

  • Information extraction, business media

ID: 4656479