Distributed Storage and Query for Domain Knowledge Graphs

  • Conference paper
  • First Online:
Web and Big Data. APWeb-WAIM 2020 International Workshops (APWeb-WAIM 2020)

Abstract

The development of knowledge graph needs the support of a vast quantity of data. However, the amount of data increases rapidly is placing increasing demands on machines. Centralized data storage requires high-performance hosts to store data, which is costly and have single point of failure. Distributed data storage can reduce the cost of the machine greatly, and there is no single point of failure, but it has requirements for partition and storage of data collection. In the knowledge storage of specific domain, the way of graph data partition and storage vary from the different domain knowledge. To solve the above problems, a scheme of graph partition and distributed storage for domain-specific knowledge graphs is proposed. The proposed graph partition scheme pays attention to the correlation between the data, and divides the nodes affiliated each other into the same or similar partition. A distributed aggregation storage scheme is designed, which makes full use of cluster performance and solves the problem of data consistency during data insertion and update. The proposed distributed storage scheme based on HBase combines Neo4j to realize visual query effectively. Experimental results show the efficiency and the effectiveness of the proposed method in partition time, the number of edge-cut and update time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. **n, H., Jiangli, D., Jiangli, D.: Scalable aggregate keyword query over knowledge graph. Future Gener. Comput. Syst. 107, 588–600 (2020)

    Article  Google Scholar 

  2. Li, W., Song, M., Tian, Y.: An ontology-driven cyberinfrastructure for intelligent spatiotemporal question answering and open knowledge discovery. ISPRS Int. J. Geo-Inf. 8(11), 496 (2019)

    Article  Google Scholar 

  3. Barnawi, A., Alharbi, M., Chen, M.: Intelligent search and find system for robotic platform based on smart edge computing service. IEEE Access 8, 108821–108834 (2020)

    Article  Google Scholar 

  4. Fensel, A., Akbar, Z., Kärle, E., et al.: Knowledge graphs for online marketing and sales of touristic services. Information 11(5), 253 (2020)

    Article  Google Scholar 

  5. Sousa, R.T., Silva, S., Pesquita, C.: Evolving knowledge graph similarity for supervised learning in complex biomedical domains. BMC Bioinform. 21(1), 6 (2020)

    Article  Google Scholar 

  6. Yuan, L., Huang, Z., Zhao, W., et al.: Interpreting and predicting social commerce intention based on knowledge graph analysis. Electron. Commer. Res. 20(1), 197–222 (2020)

    Article  Google Scholar 

  7. Fan, T., Yan, L., Ma, Z.: Storing and querying fuzzy RDF(S) in HBase databases. Int. J. Intell. Syst. 35(4), 751–780 (2020)

    Article  Google Scholar 

  8. Zhu, Z., Zhou, X., Shao, K.: A novel approach based on Neo4j for multi-constrained flexible job shop scheduling problem. Comput. Ind. Eng. 130, 671–686 (2019)

    Article  Google Scholar 

  9. Fernandes, D., Bernardino, J.: Graph databases comparison: AllegroGraph, ArangoDB, InfiniteGraph, Neo4J, and OrientDB. In: DATA 2018, pp. 373–380 (2018)

    Google Scholar 

  10. Soudani, N.M., Fatemi, A., Nematbakhsh, M.: An investigation of big graph partitioning methods for distribution of graphs in vertex-centric systems. Distrib. Parallel Databases 38(1), 1–29 (2019)

    Article  Google Scholar 

  11. Filippidou, I., Kotid, Y.: Online partitioning of multi-labeled graphs. In: Proceedings of the GRADES 2015, pp. 1–6. ACM (2015)

    Google Scholar 

  12. Lotfifar, F., Johnson, M.: A serial multilevel hypergraph partitioning algorithm. Comput. Res. Repository (2016)

    Google Scholar 

  13. Preen, R.J., Smith, J.: Evolutionary n-level hypergraph partitioning with adaptive coarsening. IEEE Trans. Evol. Comput. 23(6), 962–971 (2019)

    Article  Google Scholar 

  14. Rahimian, F., Payberah, A.H., Girdzijauskas, S., et al.: JA-BE-JA: a distributed algorithm for balanced graph partitioning. In: IEEE International Conference on Self-adaptive & Self-organizing Systems, pp. 51–60. IEEE (2013)

    Google Scholar 

  15. Guerrieri, A., Montresor, A.: DFEP: distributed funding-based edge partitioning. In: Träff, J.L., Hunold, S., Versaci, F. (eds.) Euro-Par 2015. LNCS, vol. 9233, pp. 346–358. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-48096-0_27

    Chapter  Google Scholar 

  16. Mirrokni, V., Thorup, M., Zadimoghaddam, M.: Consistent hashing with bounded loads. In: Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms. Society for Industrial and Applied Mathematics, pp. 587–604 (2018)

    Google Scholar 

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China under Grant (No. 61472169, 61502215, 62072220, 61702381, U1811261); China Postdoctoral Science Foundation Funded Project (2020M672134); Science Research Fund of Liaoning Province Education Department (LJC201913); Liaoning Public Opinion and Network Security Big Data System Engineering Laboratory (No. 04-2016-0089013).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Junlu Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Shan, X., Shi, X., Ma, W., Wang, J. (2021). Distributed Storage and Query for Domain Knowledge Graphs. In: Chen, Q., Li, J. (eds) Web and Big Data. APWeb-WAIM 2020 International Workshops. APWeb-WAIM 2020. Communications in Computer and Information Science, vol 1373. Springer, Singapore. https://doi.org/10.1007/978-981-16-0479-9_10

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-0479-9_10

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-0478-2

  • Online ISBN: 978-981-16-0479-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation