A Dynamic Compression Method for Database Backup Files in Cloud Environments

  • Conference paper
  • First Online:
Data Mining and Big Data (DMBD 2021)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1454))

Included in the following conference series:

Abstract

With the progress of society and the improvement of the degree of information technology, the data storage capacity of enterprise business is becoming larger. More and more enterprises choose to hand over the storage business to specialized cloud manufacturers. Generally, for the security and reliability of user data, cloud service providers (CSP) will conduct incremental or full backups of user data frequently. However, with the expansion of business and the increase of user data volume, the burden of data backup required by CSPs is becoming heavier. At present, most CSPs use open-source Xtrabackup for hot database backup and recovery, but Xtrabackup has shortcomings in compression rate and decompression speed.

To better improve Xtrabackup, we analyzed the backup and recovery process of Xtrabackup and fully explored the factors that affect its compression rate and decompression speed. Finally, an optimization strategy is proposed to optimize the original Xtrabackup. Experiments show that the proposed optimization scheme can increase the compression rate by 30%–50%, reduce the recovery time by 40%, and the total compression time by 25%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Zhu, D., et al.: Massive files prefetching model based on LSTM neural network with cache transaction strategy. Comput. Mater. Continua 63(2), 979–993 (2020)

    Google Scholar 

  2. Wu, J., **, L., Ge, X., Wang, Y., Fu, J.: Cloud storage as the infrastructure of cloud computing. In: 2010 International Conference on Intelligent Computing and Cognitive Informatics, pp. 380–383. IEEE (2010)

    Google Scholar 

  3. Zhu, D., Du, H., Wang, Y., Peng, X.: An IoT-oriented real-time storage mechanism for massive small files based on Swift. Int. J. Embedded Syst. 12(1), 72–80 (2020)

    Article  Google Scholar 

  4. Zhu, D., Haiwen, D., Cao, N., Qiao, X., Liu, Y.: SP-TSRM: a data grou** strategy in distributed storage system. In: Vaidya, J., Li, J. (eds.) ICA3PP 2018. LNCS, vol. 11334, pp. 524–531. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-05051-1_36

    Chapter  Google Scholar 

  5. Dzhagaryan, A., Milenkovic, A.: On effectiveness of compressed file transfers to/from the cloud: an experimental evaluation. In: PECCS, pp. 173–184 (2018)

    Google Scholar 

  6. Kim, H., Yeom, H.Y., Son, Y.: An efficient database backup and recovery scheme using write-ahead logging. In: 2020 IEEE 13th International Conference on Cloud Computing (CLOUD), pp. 405–413. IEEE (2020)

    Google Scholar 

  7. Wang, R., Wang, C., Zha, L.: PACM: A prediction-based auto-adaptive compression model for HDFS. In: 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp. 1617–1626. IEEE (2016)

    Google Scholar 

  8. Facebook/Zstd: Facebook, GitHub. https://github.com/facebook/zstd (2015)

Download references

Acknowledgement

This research is based upon works supported by the Sanming University (19YG02), the Fundamental Research Funds for the Central Universities (Grant No. HIT.NSRIF.201714), Weihai Science and Technology Development Program (2016DXGJMS15), Key Research and Development Program in Shandong Provincial (2017GGX90103) and Weihai Scientific Research and Innovation Fund (2020).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shaozai Yu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhu, D. et al. (2021). A Dynamic Compression Method for Database Backup Files in Cloud Environments. In: Tan, Y., Shi, Y., Zomaya, A., Yan, H., Cai, J. (eds) Data Mining and Big Data. DMBD 2021. Communications in Computer and Information Science, vol 1454. Springer, Singapore. https://doi.org/10.1007/978-981-16-7502-7_32

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-7502-7_32

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-7501-0

  • Online ISBN: 978-981-16-7502-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Navigation