Data Exchange: Semantics and Query Answering

  • Conference paper
  • First Online:
Database Theory — ICDT 2003 (ICDT 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2572))

Included in the following conference series:

Abstract

Data exchange is the problem of taking data structured under a source schema and creating an instance of a target schema that reflects the source data as accurately as possible. In this paper, we address foundational and algorithmic issues related to the semantics of data exchange and to query answering in the context of data exchange. These issues arise because, given a source instance, there may be many target instances that satisfy the constraints of the data exchange problem. We give an algebraic specification that selects, among all solutions to the data exchange problem, a special class of solutions that we call universal. A universal solution has no more and no less data than required for data exchange and it represents the entire space of possible solutions. We then identify fairly general, and practical, conditions that guarantee the existence of a universal solution and yield algorithms to compute a canonical universal solution efficiently.We adopt the notion of “certain answers” in indefinite databases for the semantics for query answering in data exchange. We investigate the computational complexity of computing the certain answers in this context and also study the problem of computing the certain answers of target queries by simply evaluating them on a canonical universal solution.

Research carried out while these authors were visiting scientists at the IBM Almaden Research Center. Kolaitis was also partially supported by NSF Grant IIS-9907419. Miller was also partially supported by a research grant from NSERC.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 42.79
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 53.49
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. S. Abiteboul, S. Cluet, and T. Milo. Correspondence and Translation for Heterogeneous Data. In ICDT, pages 351–363, 1997.

    Google Scholar 

  2. S. Abiteboul and O. M. Duschka. Complexity of Answering Queries Using Materialized Views. In PODS, pages 254–263, 1998.

    Google Scholar 

  3. S. Abiteboul and O. M. Duschka. Complexity of Answering Queries Using Materialized Views. Unpublished full version of [2], 2000.

    Google Scholar 

  4. C. Beeri and M.Y. Vardi. A Proof Procedure for Data Dependencies. JACM, 31(4):718–741, 1984.

    Article  MATH  MathSciNet  Google Scholar 

  5. A. Calì, D. Calvanese, G. D. Giacomo, and M. Lenzerini. Data Integration under Integrity Constraints. In CAiSE, pages 262–279, 2002.

    Google Scholar 

  6. M. A. Casanova, R. Fagin, and C. H. Papadimitriou. Inclusion Dependencies and their Interaction with Functional Dependencies. JCSS, 28(1):29–59, 1984.

    MATH  MathSciNet  Google Scholar 

  7. S. S. Cosmadakis and P. C. Kanellakis. Functional and Inclusion Dependencies: A Graph Theoretic Approach. In Advances in Computing Research, volume 3, pages 163–184. 1986.

    Google Scholar 

  8. R. Fagin. Horn Clauses and Database Dependencies. JACM, 29(4):952–985, Oct. 1982.

    Article  MATH  MathSciNet  Google Scholar 

  9. R. Fagin, P. G. Kolaitis, R. J. Miller, and L. Popa. Data Exchange: Semantics and Query Answering. IBM Research Report, Nov. 2002.

    Google Scholar 

  10. M. Friedman, A. Y. Levy, and T. D. Millstein. Navigational Plans For Data Integration. In AAAI, pages 67–73, 1999.

    Google Scholar 

  11. A. Halevy. Answering Queries UsingViews:A Survey. VLDB Journal, pages 270–294, 2001.

    Google Scholar 

  12. R. Hull and M. Yoshikawa. ILOG: Declarative Creation and Manipulation of Object Identifiers. In VLDB, pages 455–468, 1990.

    Google Scholar 

  13. M. Lenzerini. Data Integration: A Theoretical Perspective. In PODS, pages 233–246, 2002.

    Google Scholar 

  14. A.Y. Levy, A. O. Mendelzon, Y. Sagiv, and D. Srivastava. Answering Queries Using Views. In PODS, pages 95–104, May 1995.

    Google Scholar 

  15. D. Maier, A. O. Mendelzon, and Y. Sagiv. Testing Implications of Data Dependencies. ACM TODS, 4(4):455–469, Dec. 1979.

    Article  Google Scholar 

  16. D. Maier, J. D. Ullman, and M.Y. Vardi. On the Foundations of the Universal Relation Model. ACM TODS, 9(2):283–308, June 1984.

    Article  MATH  MathSciNet  Google Scholar 

  17. J. A. Makowsky. Why Horn Formulas Matter in Computer Science: Initial Structures and Generic Examples. JCSS, 34(2/3):266–292, April/June 1987.

    MATH  MathSciNet  Google Scholar 

  18. R. J. Miller, L.M. Haas, and M. Hernández. Schema Map** as Query Discovery. In VLDB, pages 77–88, 2000.

    Google Scholar 

  19. L. Popa, Y. Velegrakis, R. J. Miller, M. A. Hernandez, and R. Fagin. Translating Web Data. In VLDB, pages 598–609, 2002.

    Google Scholar 

  20. N. C. Shu, B. C. Housel, and V. Y. Lum. CONVERT: A High Level Translation Definition Language for Data Conversion. Communications of the ACM, 18(10):557–567, 1975.

    Article  MATH  Google Scholar 

  21. N. C. Shu, B. C. Housel, R. W. Taylor, S. P. Ghosh, and V. Y. Lum. EXPRESS: A Data EXtraction, Processing, amd REStructuring System. TODS, 2(2):134–174, 1977.

    Article  Google Scholar 

  22. R. van der Meyden. The Complexity of Querying Indefinite Data about Linearly Ordered Domains. JCSS, 54:113–135, 1997.

    MATH  Google Scholar 

  23. R. van der Meyden. Logical Approaches to Incomplete Information:A Survey. In Logics for Databases and Information Systems, pages 307–356. Kluwer, 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fagin, R., Kolaitis, P.G., Miller, R.J., Popa, L. (2003). Data Exchange: Semantics and Query Answering. In: Calvanese, D., Lenzerini, M., Motwani, R. (eds) Database Theory — ICDT 2003. ICDT 2003. Lecture Notes in Computer Science, vol 2572. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36285-1_14

Download citation

  • DOI: https://doi.org/10.1007/3-540-36285-1_14

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00323-6

  • Online ISBN: 978-3-540-36285-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Navigation