In this paper information extraction method for the restaurant recommendation system is proposed. We aim at the development of an information extraction (IE) system which is intended to be a module of the recommendation system. The IE system is to gather information about different aspects of restaurants from online reviews, structure it and feed the recommendation module with the obtained data. The analyzed frames include service and food quality, cuisine, price level, noise level, etc. In this paper service quality, cuisine type and food quality are considered. As part of corpus preprocessing phase, a method for Russian reviews corpus analysis (as part of information extraction) is proposed. Its importance is shown at the experimental phase, when the application of machine learning techniques to aspects extraction is analyzed. It is shown that the ideas obtained at the corpus preprocessing stage can help to improve machine learning models performance.
Язык оригиналаанглийский
Страницы (с-по)201-220
ЖурналLecture Notes in Computer Science
Том8856
Номер выпуска8856
DOI
СостояниеОпубликовано - 2014

ID: 5732817