Corpus-Based Information Extraction and Opinion Mining for the Restaurant Recommendation System

Research output: Contribution to journal › Article

Department of Information Systems in Arts and Humanities

DOI

https://doi.org/10.1007/978-3-319-11397-5_21
Other version

E. Pronoza
E. Yagunova
S. Volskaya

In this paper corpus-based information extraction and opinion mining method is proposed. Our domain is restaurant reviews, and our information extraction and opinion mining module is a part of a Russian knowledge-based recommendation system. Our method is based on thorough corpus analysis and automatic selection of machine learning models and feature sets. We also pay special attention to the verification of statistical significance. According to the results of the research, Naive Bayes models perform well at classifying sentiment with respect to a restaurant aspect, while Logistic Regression is good at deciding on the relevance of a user’s review. The approach proposed can be used in similar domains, for example, hotel reviews, with data represented by colloquial non-structured texts (in contrast with the domain of technical products, books, etc.) and for other languages with rich morphology and free word order.

Original language	English
Pages (from-to)	272-284
Journal	Lecture Notes in Computer Science
Volume	8791
DOIs	https://doi.org/10.1007/978-3-319-11397-5_21
State	Published - 2014

Research areas

Information extraction, Opinion mining, Restaurant recommendation system, Machine learning

ID: 5730060

Corpus-Based Information Extraction and Opinion Mining for the Restaurant Recommendation System

Links

DOI

Research areas