A New Approach for Ontology Generation from Relational Data Design Patterns

  • Conference paper
  • First Online:
Data Mining and Big Data (DMBD 2023)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2018))

Included in the following conference series:

  • 149 Accesses

Abstract

Knowledge graphs can effectively manage domain knowledge such as entities, properties, relations and events. Recent years have witnessed quite a few successful AI applications with the help of knowledge graphs. High-quality knowledge graphs are often built with a domain ontology, which is known to be a tedious and manual intensive task. Well-designed Relational Databases (RDBs) is an important source for obtaining ontologies and knowledge graphs. For this reason, designing automatic algorithms to generate ontologies from data has been a hot research topic. However, existing methods only consider basic ontology elements directly encoded in the RDB metadata, such as table names and foreign keys. This not only leads to erroneous or meaningless ontology classes, but also overlooks a large portion of hidden relations. As we know, database design patterns (DDP) often contain hidden semantics, which should be reconstructed into the ontology. This paper studies how typical DDPs could be recognized and how to extract ontologies from such DDPs. We focus on the JC3IEDM database, a rigorously-designed RDB in the field of joint operation, which is a widely-accepted industrial standard. Through analysis, we summarize 5 representative relational DDPs and identify their metadata characteristics. Based on these characteristics, we propose a new approach to automatically recognize and classify these patterns. Finally, we derive a rule-based method to generate OWL-format ontologies according to the identified DDPs. Empirically, we verify our approach using the whole JC3IEDM database, which contains more than 200 tables and thousands of data columns. The experimental results show that our method can automatically generate the ontology for this large-scale database, and the resultant ontology has improved quality compared to existing methods.

X. Li and R. Luo—Contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Germany)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 58.84
Price includes VAT (Germany)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 74.89
Price includes VAT (Germany)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Gamma, E., Helm, R., Johnson, R.E., Vlissides, J.M.: Design patterns: elements of reuseable object-oriented software (1994). https://api.semanticscholar.org/CorpusID:54007465

  2. Giese, M., et al.: Optique: zooming in on big data. Computer 48, 60–67 (2015). https://api.semanticscholar.org/CorpusID:10348732

  3. Huang, G., Luo, C., Wu, K., Ma, Y., Zhang, Y., Liu, X.: Software-defined infrastructure for decentralized data lifecycle governance: principled design and open challenges. In: 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), pp. 1674–1683 (2019). https://api.semanticscholar.org/CorpusID:207758658

  4. Jiménez-Ruiz, E., et al.: BOOTOX: practical map** of RDBs to OWL 2. In: International Workshop on the Semantic Web (2015). https://api.semanticscholar.org/CorpusID:2287280

  5. Karwin, B.: SQL antipatterns: avoiding the pitfalls of database programming (2010). https://api.semanticscholar.org/CorpusID:63072457

  6. (MIP), M.I.P.: The joint c3 information exchange data model (jc3iedm main ipt3 v3.1.4) (2012)

    Google Scholar 

  7. Musen, M.A.: The protégé project: a look back and a look forward. AI Matters 14, 4–12 (2015). https://api.semanticscholar.org/CorpusID:13208034

  8. Pinkel, C., et al.: Incmap: a journey towards ontology-based data integration. In: Datenbanksysteme für Business, Technologie und Web (2017). https://api.semanticscholar.org/CorpusID:38803389

  9. **ao, G.: The virtual knowledge graph system Ontop. In: International Workshop on the Semantic Web (2020). https://api.semanticscholar.org/CorpusID:221317578

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qiang Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, X., Luo, R., Liu, K., Li, F., Wang, C., Wang, Q. (2024). A New Approach for Ontology Generation from Relational Data Design Patterns. In: Tan, Y., Shi, Y. (eds) Data Mining and Big Data. DMBD 2023. Communications in Computer and Information Science, vol 2018. Springer, Singapore. https://doi.org/10.1007/978-981-97-0844-4_12

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0844-4_12

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0843-7

  • Online ISBN: 978-981-97-0844-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation