On reconciling data integration, data exchange, and peer data management

Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Riccardo Rosati.
In Proceedings of the Twentysixth ACM SIGACT SIGMOD SIGART Symposium on Principles of Database Systems (PODS 2007), pages 133-142, ACM Press, 2007. ISBN 978-1-59593-685-1.

 

Abstract:

Data exchange and virtual data integration have been the subject of several investigations in the recent literature. At the same time, the notion of peer data management has emerged as a powerful abstraction of many forms of flexible and dynamic data-centered distributed systems. Although research on the above issues has progressed considerably in the last years, a clear understanding on how to combine data exchange and data integration in peer data management is still missing. This is the subject of the present paper. We start our investigation by first proposing a novel framework for peer data exchange, showing that it is a generalization of the classical data exchange setting. We also present algorithms for all the relevant data exchange tasks, and show that they can all be done in polynomial time with respect to data complexity. Based on the motivation that typical mappings and integrity constraints found in data integration are not captured by peer data exchange, we extend the framework to incorporate these features. One of the main difficulties is that the constraints of this new class are not amenable to materialization. We address this issue by resorting to a suitable combination of virtual and materialized data exchange, showing that the resulting framework is a generalization of both classical data exchange and classical data integration, and that the new setting incorporates the most expressive types of mapping and constraints considered in the two contexts. Finally, we present algorithms for all the relevant data management tasks also in the new setting, and show that, again, their data complexity is polynomial.

Bibtex entry:

@String{PODS-07 = "Proceedings of the Twentysixth ACM SIGACT SIGMOD SIGART Symposium on Principles of Database Systems (PODS~2007)"}

@String{ACM = "{ACM} Press"}

@Inproceedings{DLLR07,
author = "De Giacomo, Giuseppe and Domenico Lembo and Maurizio Lenzerini and Riccardo Rosati",
title = "On reconciling data integration, data exchange, and peer data management",
booktitle = PODS-07,
pages = "133--142",
publisher = ACM,
year = 2007,
isbn = "978-1-59593-685-1",
}

Link to electronic version of published paper