Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
The present article addresses the problem of a hotel deduplication. Obvious approaches, such as name or location comparisons, fail, because hotel descriptions differ among different databases. The most accurate approach to solve this problem is to use the professionally trained content managers, but it is expensive, hence an automatic solution should be implemented. We propose a method to improve a hypothesis that a pair of hotels is identical, and compare its performance with alternative solutions. The proposed method satisfies business requirements set for the precision and recall of the hotel deduplication task. The method is based on machine learning approach with the use of some unique features, including those built with the help of computer vision algorithms.
Язык оригинала | английский |
---|---|
Название основной публикации | Knowledge Engineering and Semantic Web - 7th International Conference, KESW 2016, Proceedings |
Редакторы | Axel-Cyrille Ngonga Ngomo, Petr Křemen |
Издатель | Springer Nature |
Страницы | 230-240 |
Число страниц | 11 |
ISBN (печатное издание) | 9783319458793 |
DOI | |
Состояние | Опубликовано - 2016 |
Событие | 7th International Conference on Knowledge Engineering and Semantic Web, KESW 2016 - Prague, Чехия Продолжительность: 21 сен 2016 → 23 сен 2016 |
Название | Communications in Computer and Information Science |
---|---|
Том | 649 |
ISSN (печатное издание) | 1865-0929 |
конференция | 7th International Conference on Knowledge Engineering and Semantic Web, KESW 2016 |
---|---|
Страна/Tерритория | Чехия |
Город | Prague |
Период | 21/09/16 → 23/09/16 |
ID: 86415654