Performing external join operator on PostgreSQL with data transfer approach

Ryota Takizawa, Hideyuki Kawashima, Ryuya Mitsuhashi, Osamu Tatebe

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

With the development of sensing devices, the size of data managed by human being has been rapidly increasing. To manage such huge data, relational database management system (RDBMS) plays a key role. RDBMS models the real world data as n-ary relational tables. Join operator is one of the most important relational operators, and its acceleration has been studied widely and deeply. How can an RDBMS provide such an efficient join operator? The performance improvement of join operator has been deeply studied for a decade, and many techniques are proposed already. The problem that we face is how to actually use such excellent techniques in real RDBMSs. We propose to implement an efficient join technique by the data transfer approach. The approach makes a hook point inside an RDBMS internal, and pulls data streams from the operator pipeline in the RDBMS, and applies our original join operator to the data, and finally returns the result to the operator pipeline in the RDBMS. The result of the experiment showed that our proposed method achieved 1.42x speedup compared with PostgreSQL. Our code is available on GitHub.

Original languageEnglish
Title of host publicationProceedings of International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2018
PublisherAssociation for Computing Machinery
Pages271-277
Number of pages7
ISBN (Electronic)9781450353724
DOIs
Publication statusPublished - 2018 Jan 28
Externally publishedYes
Event2018 International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2018 - Tokyo, Japan
Duration: 2018 Jan 282018 Jan 31

Publication series

NameACM International Conference Proceeding Series

Other

Other2018 International Conference on High Performance Computing in Asia-Pacific Region, HPC Asia 2018
Country/TerritoryJapan
CityTokyo
Period18/1/2818/1/31

Keywords

  • Parallel Hash Join
  • PostgreSQL
  • Relational database

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Performing external join operator on PostgreSQL with data transfer approach'. Together they form a unique fingerprint.

Cite this