Discovering and merging related analytic datasets

作者:

Highlights:

• Attribute graphs for literal functional dependencies in hierarchical dimensions.

• Schema augmentation and reduction operations to eliminate tuple multiplication.

• Quality criteria and automatic repair operations for merging schema augmentations.

• Detailed description of the implementations within SAP HANA platform.

• Experimental evaluations on real datasets and usage scenarios.

摘要

•Attribute graphs for literal functional dependencies in hierarchical dimensions.•Schema augmentation and reduction operations to eliminate tuple multiplication.•Quality criteria and automatic repair operations for merging schema augmentations.•Detailed description of the implementations within SAP HANA platform.•Experimental evaluations on real datasets and usage scenarios.

论文关键词:Schema augmentation,Schema complement,Data quality,SAP HANA

论文评审过程:Received 30 October 2019, Revised 6 January 2020, Accepted 12 January 2020, Available online 17 January 2020, Version of Record 29 January 2020.

论文官网地址:https://doi.org/10.1016/j.is.2020.101495