In this paper information extraction method for the restaurant recommendation system is proposed. We aim at the development of an information extraction (IE) system which is intended to be a module of the recommendation system. The IE system is to gather information about different aspects of restaurants from online reviews, structure it and feed the recommendation module with the obtained data. The analyzed frames include service and food quality, cuisine, price level, noise level, etc. In this paper service quality, cuisine type and food quality are considered. As part of corpus preprocessing phase, a method for Russian reviews corpus analysis (as part of information extraction) is proposed. Its importance is shown at the experimental phase, when the application of machine learning techniques to aspects extraction is analyzed. It is shown that the ideas obtained at the corpus preprocessing stage can help to improve machine learning models performance.
Original languageEnglish
Pages (from-to)201-220
JournalLecture Notes in Computer Science
Volume8856
Issue number8856
DOIs
StatePublished - 2014

    Research areas

  • corpus analysis, restaurant reviews, information extraction, recommendation system, machine learning

ID: 5732817