AuMixDw: Towards an automated hybrid approach for building XML data warehouses

作者:

Highlights:

摘要

In this paper, we present a mixed approach for building XML data warehouses from both XML data sources and user requirements. Our proposed approach aims at obtaining a unique multidimensional schema of theXML data warehouse. The approach follows three steps. During the first step, an intermediate SBVR model extended with template rules is used to accommodate a data warehousing system and to facilitate the automatic identification of facts and dimensions from the user requirements. After modelling XML data sources in UML, the second step corresponds to identifying candidate DW schemata from such data sources. The third step compares these candidate schemata with the reference model obtained from the user requirements. In this step, we propose to adapt similarity metric-extended Boolean models (BIR) and to use them in order to measure, rank and select the most appropriate data warehouse schema. Such a schema should best describe the data sources and exhaustively cover all the needed user requirements. To demonstrate our approach, we present a case study of the bibliographic database dblp.

论文关键词:XML,SBVR,BIR,Data warehouse,User requirement,Dimensional modelling

论文评审过程:Received 16 May 2018, Revised 22 January 2019, Accepted 30 January 2019, Available online 20 February 2019, Version of Record 8 April 2019.

论文官网地址:https://doi.org/10.1016/j.datak.2019.01.004