Abstract
Knowledge graphs can effectively manage domain knowledge such as entities, properties, relations and events. Recent years have witnessed quite a few successful AI applications with the help of knowledge graphs. High-quality knowledge graphs are often built with a domain ontology, which is known to be a tedious and manual intensive task. Well-designed Relational Databases (RDBs) is an important source for obtaining ontologies and knowledge graphs. For this reason, designing automatic algorithms to generate ontologies from data has been a hot research topic. However, existing methods only consider basic ontology elements directly encoded in the RDB metadata, such as table names and foreign keys. This not only leads to erroneous or meaningless ontology classes, but also overlooks a large portion of hidden relations. As we know, database design patterns (DDP) often contain hidden semantics, which should be reconstructed into the ontology. This paper studies how typical DDPs could be recognized and how to extract ontologies from such DDPs. We focus on the JC3IEDM database, a rigorously-designed RDB in the field of joint operation, which is a widely-accepted industrial standard. Through analysis, we summarize 5 representative relational DDPs and identify their metadata characteristics. Based on these characteristics, we propose a new approach to automatically recognize and classify these patterns. Finally, we derive a rule-based method to generate OWL-format ontologies according to the identified DDPs. Empirically, we verify our approach using the whole JC3IEDM database, which contains more than 200 tables and thousands of data columns. The experimental results show that our method can automatically generate the ontology for this large-scale database, and the resultant ontology has improved quality compared to existing methods.
X. Li and R. Luo—Contributed equally to this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Gamma, E., Helm, R., Johnson, R.E., Vlissides, J.M.: Design patterns: elements of reuseable object-oriented software (1994). https://api.semanticscholar.org/CorpusID:54007465
Giese, M., et al.: Optique: zooming in on big data. Computer 48, 60–67 (2015). https://api.semanticscholar.org/CorpusID:10348732
Huang, G., Luo, C., Wu, K., Ma, Y., Zhang, Y., Liu, X.: Software-defined infrastructure for decentralized data lifecycle governance: principled design and open challenges. In: 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), pp. 1674–1683 (2019). https://api.semanticscholar.org/CorpusID:207758658
Jiménez-Ruiz, E., et al.: BOOTOX: practical map** of RDBs to OWL 2. In: International Workshop on the Semantic Web (2015). https://api.semanticscholar.org/CorpusID:2287280
Karwin, B.: SQL antipatterns: avoiding the pitfalls of database programming (2010). https://api.semanticscholar.org/CorpusID:63072457
(MIP), M.I.P.: The joint c3 information exchange data model (jc3iedm main ipt3 v3.1.4) (2012)
Musen, M.A.: The protégé project: a look back and a look forward. AI Matters 14, 4–12 (2015). https://api.semanticscholar.org/CorpusID:13208034
Pinkel, C., et al.: Incmap: a journey towards ontology-based data integration. In: Datenbanksysteme für Business, Technologie und Web (2017). https://api.semanticscholar.org/CorpusID:38803389
**ao, G.: The virtual knowledge graph system Ontop. In: International Workshop on the Semantic Web (2020). https://api.semanticscholar.org/CorpusID:221317578
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Li, X., Luo, R., Liu, K., Li, F., Wang, C., Wang, Q. (2024). A New Approach for Ontology Generation from Relational Data Design Patterns. In: Tan, Y., Shi, Y. (eds) Data Mining and Big Data. DMBD 2023. Communications in Computer and Information Science, vol 2018. Springer, Singapore. https://doi.org/10.1007/978-981-97-0844-4_12
Download citation
DOI: https://doi.org/10.1007/978-981-97-0844-4_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0843-7
Online ISBN: 978-981-97-0844-4
eBook Packages: Computer ScienceComputer Science (R0)