Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
In this paper we continue our efforts to evaluate matrix clustering algorithms. In our previous study we presented a test environment and results of preliminary experiments with the “separate” strategy for vertical partitioning. This strategy assigns a separate vertical partition for every cluster found by the algorithm, including inter-submatrix attribute group. In this paper we introduce two other strategies: the “replicate” strategy, which replicates inter-submatrix attributes to every cluster and the “retain” strategy, which assigns inter-submatrix attributes to their original clusters. We experimentally evaluate all strategies in a disk-based environment using the standard TPC-H workload and the PostgreSQL DBMS. We start with the study of record reconstruction methods in the PostgreSQL DBMS. Then, we apply partitioning strategies to three matrix clustering algorithms and evaluate both query performance and storage overhead of the resulting partitions. Finally, we compare the resulting partitioning schemes with the ideal partitioning scenario.
Original language | English |
---|---|
Title of host publication | Data Analytics and Management in Data Intensive Domains - XVIII International Conference, DAMDID/RCDL 2016, Revised Selected Papers |
Editors | Yannis Manolopoulos, Leonid Kalinichenko, Sergei O. Kuznetsov |
Publisher | Springer Nature |
Pages | 163-177 |
Number of pages | 15 |
ISBN (Print) | 9783319571348 |
DOIs | |
State | Published - 2017 |
Event | 18th International Conference on Data Analytics and Management in Data-Intensive Domains, DAMDID 2016 - Ershovo, Russian Federation Duration: 11 Oct 2016 → 14 Oct 2016 |
Name | Communications in Computer and Information Science |
---|---|
Volume | 706 |
ISSN (Print) | 1865-0929 |
Conference | 18th International Conference on Data Analytics and Management in Data-Intensive Domains, DAMDID 2016 |
---|---|
Country/Territory | Russian Federation |
City | Ershovo |
Period | 11/10/16 → 14/10/16 |
ID: 72709067