Abstract
The development of knowledge graph needs the support of a vast quantity of data. However, the amount of data increases rapidly is placing increasing demands on machines. Centralized data storage requires high-performance hosts to store data, which is costly and have single point of failure. Distributed data storage can reduce the cost of the machine greatly, and there is no single point of failure, but it has requirements for partition and storage of data collection. In the knowledge storage of specific domain, the way of graph data partition and storage vary from the different domain knowledge. To solve the above problems, a scheme of graph partition and distributed storage for domain-specific knowledge graphs is proposed. The proposed graph partition scheme pays attention to the correlation between the data, and divides the nodes affiliated each other into the same or similar partition. A distributed aggregation storage scheme is designed, which makes full use of cluster performance and solves the problem of data consistency during data insertion and update. The proposed distributed storage scheme based on HBase combines Neo4j to realize visual query effectively. Experimental results show the efficiency and the effectiveness of the proposed method in partition time, the number of edge-cut and update time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
**n, H., Jiangli, D., Jiangli, D.: Scalable aggregate keyword query over knowledge graph. Future Gener. Comput. Syst. 107, 588–600 (2020)
Li, W., Song, M., Tian, Y.: An ontology-driven cyberinfrastructure for intelligent spatiotemporal question answering and open knowledge discovery. ISPRS Int. J. Geo-Inf. 8(11), 496 (2019)
Barnawi, A., Alharbi, M., Chen, M.: Intelligent search and find system for robotic platform based on smart edge computing service. IEEE Access 8, 108821–108834 (2020)
Fensel, A., Akbar, Z., Kärle, E., et al.: Knowledge graphs for online marketing and sales of touristic services. Information 11(5), 253 (2020)
Sousa, R.T., Silva, S., Pesquita, C.: Evolving knowledge graph similarity for supervised learning in complex biomedical domains. BMC Bioinform. 21(1), 6 (2020)
Yuan, L., Huang, Z., Zhao, W., et al.: Interpreting and predicting social commerce intention based on knowledge graph analysis. Electron. Commer. Res. 20(1), 197–222 (2020)
Fan, T., Yan, L., Ma, Z.: Storing and querying fuzzy RDF(S) in HBase databases. Int. J. Intell. Syst. 35(4), 751–780 (2020)
Zhu, Z., Zhou, X., Shao, K.: A novel approach based on Neo4j for multi-constrained flexible job shop scheduling problem. Comput. Ind. Eng. 130, 671–686 (2019)
Fernandes, D., Bernardino, J.: Graph databases comparison: AllegroGraph, ArangoDB, InfiniteGraph, Neo4J, and OrientDB. In: DATA 2018, pp. 373–380 (2018)
Soudani, N.M., Fatemi, A., Nematbakhsh, M.: An investigation of big graph partitioning methods for distribution of graphs in vertex-centric systems. Distrib. Parallel Databases 38(1), 1–29 (2019)
Filippidou, I., Kotid, Y.: Online partitioning of multi-labeled graphs. In: Proceedings of the GRADES 2015, pp. 1–6. ACM (2015)
Lotfifar, F., Johnson, M.: A serial multilevel hypergraph partitioning algorithm. Comput. Res. Repository (2016)
Preen, R.J., Smith, J.: Evolutionary n-level hypergraph partitioning with adaptive coarsening. IEEE Trans. Evol. Comput. 23(6), 962–971 (2019)
Rahimian, F., Payberah, A.H., Girdzijauskas, S., et al.: JA-BE-JA: a distributed algorithm for balanced graph partitioning. In: IEEE International Conference on Self-adaptive & Self-organizing Systems, pp. 51–60. IEEE (2013)
Guerrieri, A., Montresor, A.: DFEP: distributed funding-based edge partitioning. In: Träff, J.L., Hunold, S., Versaci, F. (eds.) Euro-Par 2015. LNCS, vol. 9233, pp. 346–358. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-48096-0_27
Mirrokni, V., Thorup, M., Zadimoghaddam, M.: Consistent hashing with bounded loads. In: Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms. Society for Industrial and Applied Mathematics, pp. 587–604 (2018)
Acknowledgements
This work was supported by National Natural Science Foundation of China under Grant (No. 61472169, 61502215, 62072220, 61702381, U1811261); China Postdoctoral Science Foundation Funded Project (2020M672134); Science Research Fund of Liaoning Province Education Department (LJC201913); Liaoning Public Opinion and Network Security Big Data System Engineering Laboratory (No. 04-2016-0089013).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shan, X., Shi, X., Ma, W., Wang, J. (2021). Distributed Storage and Query for Domain Knowledge Graphs. In: Chen, Q., Li, J. (eds) Web and Big Data. APWeb-WAIM 2020 International Workshops. APWeb-WAIM 2020. Communications in Computer and Information Science, vol 1373. Springer, Singapore. https://doi.org/10.1007/978-981-16-0479-9_10
Download citation
DOI: https://doi.org/10.1007/978-981-16-0479-9_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0478-2
Online ISBN: 978-981-16-0479-9
eBook Packages: Computer ScienceComputer Science (R0)