In this paper we present a novel disk-based distributed column-store, describe its architecture and discuss a number of technical solutions. Our system is essentially a query engine which was written completely from scratch. It is aimed for shared-nothing environments and supports different forms of parallel query processing. Query processing in PosDB is organized according to the classic Volcano pull-based model which is adapted for the column-store case. Currently, we support late materialization only, and therefore employ a join index data structure to represent positional information. In our system query plan can consist of both positional and value operators. PosDB has about a dozen of core operators among which several variants of selections and joins, aggregation. We also have several operators that ensure intra-query parallelism and operators for network interoperability. In its current state the system is fully capable of processing the Star Schema Benchmark in a local and distributed environment.

Original languageEnglish
Title of host publicationPerspectives of System Informatics - 11th International Andrei P. Ershov Informatics Conference, PSI 2017, Revised Selected Papers
EditorsAlexander K. Petrenko, Andrei Voronkov
PublisherSpringer Nature
Pages88-94
Number of pages7
ISBN (Print)9783319743127
DOIs
StatePublished - 1 Jan 2018
Event11th International Andrei Ershov Memorial Conference on Perspectives of System Informatics, PSI 2017 - Moscow, Russian Federation
Duration: 27 Jun 201729 Jun 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10742 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Andrei Ershov Memorial Conference on Perspectives of System Informatics, PSI 2017
Country/TerritoryRussian Federation
CityMoscow
Period27/06/1729/06/17

    Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

ID: 35272725