Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
PosDB: A distributed column-store engine. / Chernishev, George; Galaktionov, Viacheslav; Grigorev, Valentin; Klyuchikov, Evgeniy; Smirnov, Kirill.
Perspectives of System Informatics - 11th International Andrei P. Ershov Informatics Conference, PSI 2017, Revised Selected Papers. ed. / Alexander K. Petrenko; Andrei Voronkov. Springer Nature, 2018. p. 88-94 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10742 LNCS).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
}
TY - GEN
T1 - PosDB: A distributed column-store engine
AU - Chernishev, George
AU - Galaktionov, Viacheslav
AU - Grigorev, Valentin
AU - Klyuchikov, Evgeniy
AU - Smirnov, Kirill
PY - 2018/1/1
Y1 - 2018/1/1
N2 - In this paper we present a novel disk-based distributed column-store, describe its architecture and discuss a number of technical solutions. Our system is essentially a query engine which was written completely from scratch. It is aimed for shared-nothing environments and supports different forms of parallel query processing. Query processing in PosDB is organized according to the classic Volcano pull-based model which is adapted for the column-store case. Currently, we support late materialization only, and therefore employ a join index data structure to represent positional information. In our system query plan can consist of both positional and value operators. PosDB has about a dozen of core operators among which several variants of selections and joins, aggregation. We also have several operators that ensure intra-query parallelism and operators for network interoperability. In its current state the system is fully capable of processing the Star Schema Benchmark in a local and distributed environment.
AB - In this paper we present a novel disk-based distributed column-store, describe its architecture and discuss a number of technical solutions. Our system is essentially a query engine which was written completely from scratch. It is aimed for shared-nothing environments and supports different forms of parallel query processing. Query processing in PosDB is organized according to the classic Volcano pull-based model which is adapted for the column-store case. Currently, we support late materialization only, and therefore employ a join index data structure to represent positional information. In our system query plan can consist of both positional and value operators. PosDB has about a dozen of core operators among which several variants of selections and joins, aggregation. We also have several operators that ensure intra-query parallelism and operators for network interoperability. In its current state the system is fully capable of processing the Star Schema Benchmark in a local and distributed environment.
UR - http://www.scopus.com/inward/record.url?scp=85041712878&partnerID=8YFLogxK
UR - http://www.mendeley.com/research/posdb-distributed-columnstore-engine
U2 - 10.1007/978-3-319-74313-4_7
DO - 10.1007/978-3-319-74313-4_7
M3 - Conference contribution
AN - SCOPUS:85041712878
SN - 9783319743127
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 88
EP - 94
BT - Perspectives of System Informatics - 11th International Andrei P. Ershov Informatics Conference, PSI 2017, Revised Selected Papers
A2 - Petrenko, Alexander K.
A2 - Voronkov, Andrei
PB - Springer Nature
T2 - 11th International Andrei Ershov Memorial Conference on Perspectives of System Informatics, PSI 2017
Y2 - 27 June 2017 through 29 June 2017
ER -
ID: 35272725