Corpus-Based Information Extraction and Opinion Mining for the Restaurant Recommendation System

Corpus-Based Information Extraction and Opinion Mining for the Restaurant Recommendation System

Результаты исследований: Научные публикации в периодических изданиях › статья

Кафедра информационных систем в искусстве и гуманитарных науках

Ссылки

http://link.springer.com/chapter/10.1007/978-3-319-11397-5_21

DOI

https://doi.org/10.1007/978-3-319-11397-5_21
Другие версии

E. Pronoza
E. Yagunova
S. Volskaya

In this paper corpus-based information extraction and opinion mining method is proposed. Our domain is restaurant reviews, and our information extraction and opinion mining module is a part of a Russian knowledge-based recommendation system. Our method is based on thorough corpus analysis and automatic selection of machine learning models and feature sets. We also pay special attention to the verification of statistical significance. According to the results of the research, Naive Bayes models perform well at classifying sentiment with respect to a restaurant aspect, while Logistic Regression is good at deciding on the relevance of a user’s review. The approach proposed can be used in similar domains, for example, hotel reviews, with data represented by colloquial non-structured texts (in contrast with the domain of technical products, books, etc.) and for other languages with rich morphology and free word order.

Язык оригинала	английский
Страницы (с-по)	272-284
Журнал	Lecture Notes in Computer Science
Том	8791
DOI	https://doi.org/10.1007/978-3-319-11397-5_21
Состояние	Опубликовано - 2014

ID: 5730060

Pure – это продукт компании Elsevier
На данном информационном ресурсе могут быть опубликованы архивные материалы
с упоминанием физических и юридических лиц, включенных Министерством юстиции
Российской Федерации в реестр иностранных агентов

Вход в Pure