A window function is a generalization of the aggregation operation. Unlike aggregation, the cardinality of its output is always the same as the cardinality of input. That is, the semantics of this operator imply computing values for extra attributes for each row, depending on its context, either expressed by a sliding window or a previously evaluated row. Window functions are a very powerful tool, which is also popular among data analysts and supported by the majority of industrial DBMSes. It allows to gracefully express quite complex use-cases, such as running sums and averages, local maximum and minimum, and different types of ranking. Since they can be expressed without self-joins and correlated subqueries, their evaluation can be performed much more efficiently. In this paper we discuss an implementation of window functions inside a disk-based column-store with late materialization. Late materialization is a technique that aims to keep tuple reconstruction back from individual columns as long as possible. Initially popular in the late 00’s, it is rarely considered nowadays. However, in case of window functions it allows to substantially lower memory footprint. Another contribution of this paper is the application of a segment tree to computing RANGE-based window functions.

Original languageEnglish
Title of host publicationModel and Data Engineering- 9th International Conference, MEDI 2019, Proceedings
EditorsKlaus-Dieter Schewe, Neeraj Kumar Singh
PublisherSpringer Nature
Pages303-313
Number of pages11
ISBN (Print)9783030320645
DOIs
StatePublished - 1 Jan 2019
Event9th International Conference on Model and Data Engineering, MEDI 2019 - Toulouse, France
Duration: 28 Oct 201931 Oct 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11815 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Model and Data Engineering, MEDI 2019
Country/TerritoryFrance
CityToulouse
Period28/10/1931/10/19

    Research areas

  • Aggregation, Analytical function, Column-store, Late materialization, OLAP, PosDB, Query processing, Window function

    Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

ID: 49653804