In this paper an approach to primary business-media analysis for further information extraction is proposed. We consider business events representation by looking into part of speech (POS) distribution across tagged n-grams. Two Russian business-media corpora, Russian Business Consulting (RBC) and Commersant, are analyzed, and it is shown that they differ not only in style or themes coverage but also in the range of contexts for the words which mark business-events. Purchase, merger and ownership events are given a closer look at, and it is shown that they are mostly represented by noun phrases in both corpora rather than verbal phrases.
Язык оригиналаанглийский
Название основной публикацииResearch in Computing Science: Advances in Soft Computing Techniques
Страницы103-114
СостояниеОпубликовано - 2013
Опубликовано для внешнего пользованияДа

ID: 4656479