An approach for the extensional integration of data sources with heterogeneous representation formats

作者:

Highlights:

摘要

In this paper we propose an approach for the extensional integration of data sources with heterogeneous representation formats. The proposed approach is based on the exploitation of a new model, called E-SDR-Network, for representing and handling, at the extensional level, heterogeneous data sources, ranging from databases to XML documents, object exchange model graphs and other semi-structured data. Due to the specific features of E-SDR-Network, the proposed extensional integration methodology is capable of: (i) easily handling null or unknown values, (ii) producing consistent query answers from possibly inconsistent data and (iii) reconstructing, at the extensional level, the content of each data source involved in the integration task. Finally, we show that E-SDR-Network and the proposed extensional integration algorithm are the counterpart, at the extensional level, of the SDR-Network conceptual model and the associated intensional integration algorithm, already proposed in the literature. Therefore, in the whole, we obtain a complete approach consisting of two components performing synergically both the intensional and the extensional integration of data sources having heterogeneous data representation formats.

论文关键词:Extensional integration,Data sources,Heterogeneous representation formats

论文评审过程:Received 17 July 2002, Revised 24 September 2002, Accepted 23 October 2002, Available online 19 November 2002.

论文官网地址:https://doi.org/10.1016/S0169-023X(02)00192-1