Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
In this paper we continue our efforts to evaluate matrix clustering algorithms. In our previous study we presented a test environment and results of preliminary experiments with the “separate” strategy for vertical partitioning. This strategy assigns a separate vertical partition for every cluster found by the algorithm, including inter-submatrix attribute group. In this paper we introduce two other strategies: the “replicate” strategy, which replicates inter-submatrix attributes to every cluster and the “retain” strategy, which assigns inter-submatrix attributes to their original clusters. We experimentally evaluate all strategies in a disk-based environment using the standard TPC-H workload and the PostgreSQL DBMS. We start with the study of record reconstruction methods in the PostgreSQL DBMS. Then, we apply partitioning strategies to three matrix clustering algorithms and evaluate both query performance and storage overhead of the resulting partitions. Finally, we compare the resulting partitioning schemes with the ideal partitioning scenario.
Язык оригинала | английский |
---|---|
Название основной публикации | Data Analytics and Management in Data Intensive Domains - XVIII International Conference, DAMDID/RCDL 2016, Revised Selected Papers |
Редакторы | Yannis Manolopoulos, Leonid Kalinichenko, Sergei O. Kuznetsov |
Издатель | Springer Nature |
Страницы | 163-177 |
Число страниц | 15 |
ISBN (печатное издание) | 9783319571348 |
DOI | |
Состояние | Опубликовано - 2017 |
Событие | 18th International Conference on Data Analytics and Management in Data-Intensive Domains, DAMDID 2016 - Ershovo, Российская Федерация Продолжительность: 11 окт 2016 → 14 окт 2016 |
Название | Communications in Computer and Information Science |
---|---|
Том | 706 |
ISSN (печатное издание) | 1865-0929 |
конференция | 18th International Conference on Data Analytics and Management in Data-Intensive Domains, DAMDID 2016 |
---|---|
Страна/Tерритория | Российская Федерация |
Город | Ershovo |
Период | 11/10/16 → 14/10/16 |
ID: 72709067