Fast Accurate Discovery of Tuple Inclusion Dependencies

Mengfei Shen, Hideyuki Kawashima, Kazuhiro Saito

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Inclusion dependencies (IND) is an important problem in relational database, relevant to data integration, query optimization and various data management tasks. The discovery of IND has been addressed by many studies following different strategies, while IND detection still needs improvement as the complexity and diversity of real-life data increase. Conventional IND is only for column-to-column dimension, which is not applicable to lots of data processing tasks. The concept of dependency can be expanded. Based on the understanding of the conventional IND and approximate approach FAIDA, we present our algorithm for detecting tuple IND, converting column-to-column detection to row-to-row dimension, more in line with real-world data retrieval tasks in distributed system. Through probabilistic and accurate detection and the use of multi-threading, both accuracy and performance are guaranteed and IND detection performance is taken to a new level.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE International Conference on Smart Computing, SMARTCOMP 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages246-251
Number of pages6
ISBN (Electronic)9781665481526
DOIs
Publication statusPublished - 2022
Event8th IEEE International Conference on Smart Computing, SMARTCOMP 2022 - Espoo, Finland
Duration: 2022 Jun 202022 Jun 24

Publication series

NameProceedings - 2022 IEEE International Conference on Smart Computing, SMARTCOMP 2022

Conference

Conference8th IEEE International Conference on Smart Computing, SMARTCOMP 2022
Country/TerritoryFinland
CityEspoo
Period22/6/2022/6/24

Keywords

  • Data Integration
  • Data Profiling
  • Inclusion Dependencies

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Fast Accurate Discovery of Tuple Inclusion Dependencies'. Together they form a unique fingerprint.

Cite this