Exploitation of logical schema information can allow producing better physical designs for a database. In order to exploit this information, one has to extract it from the data stored in the database. Extraction should be performed using some kind of an algorithm that provides an acceptable level of result quality. This quality has to be ensured, for example, in terms of precision.

In this paper we consider a particular type of such information: functional dependencies. One of the well-known algorithms for extraction of functional dependencies is the TANE algorithm. We propose to study its precision-related properties which are relevant for its use in our automatic physical design tool. TANE, being an approximate algorithm, returns only a fraction of existing dependencies. It is also prone to false positives. In contrast with the previous research, which measured run times and memory consumption, we aim to evaluate the quality of this algorithm.

Finally, we briefly describe the context of this study—constructing an alternative physical design tuning system that would use the output of the TANE algorithm. The system is an ordinary vertical partitioning tool, but which operates without workload knowledge, relying on data characteristics. Our plan is to employ TANE inside the functional dependency detection component. Thus, the purpose of evaluation is to study to what extent the properties of the algorithm affect our goals.
Original languageEnglish
Title of host publicationModel and Data Engineering
Subtitle of host publication7th International Conference, MEDI 2017, Barcelona, Spain, October 4–6, 2017, Proceedings
PublisherSpringer Nature
Pages208-222
ISBN (Electronic)978-3-319-66854-3
ISBN (Print)978-3-319-66853-6
StatePublished - 2017
EventModel and Data Engineering: 7th International Conference - Barcelona, Spain
Duration: 4 Oct 20176 Oct 2017

Publication series

NameLecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
PublisherSpringer Nature
Volume10563
ISSN (Print)0302-9743

Conference

ConferenceModel and Data Engineering
Abbreviated titleMEDI 2017
Country/TerritorySpain
CityBarcelona
Period4/10/176/10/17

    Research areas

  • TANE, Physical design tuning, Vertical partitioning, Logical schema information, Functional dependency, Functional dependency detection, Experimentation

    Scopus subject areas

  • Computer Science(all)

ID: 71304573