Log in

A deep learning based multi-image compression technique

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

A multi-image compression technique compresses multiple images of the same or various sizes together to generate a common codebook. In multi-image compression, the size of the common codebook or code vector matrix formed from multiple images is crucial to the algorithm's compression ratio performance. This codebook comprises codewords created by the multi-Image compression technique after various tuning settings have been modified. The compression ratio of the multi-image compression approach can be improved even more by lowering the size of the common code vector matrix. The common codebook or code vector matrix is reduced in size in this study using deep learning based auto-encoder technology. The encoded matrix is substantially smaller than the matrix formed using standard encoding techniques. For decoding purposes, information on the number of neurons and layers employed during encoding is also stored. The suggested approach is tested on a large number of standard photos and images from the UCID version 2 database. The experimental results are examined using compression ratio, PSNR, and SSIM. The results demonstrate that the suggested technique decreases the size of the common code vector matrix or codebook by 20%, improving overall algorithm performance by about 1.5% while maintaining the visual quality of the decompressed images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price includes VAT (France)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Data availability

Not applicable.

References

  1. Gonzalez, R.C., Woods, R.E., Eddins, S.L.: Digital Image processing using MATLB. Mc-Graw Hill, New York (2011)

    Google Scholar 

  2. Gan, G., Ma, C., Wu, J.: Data clustering theory, algorithms and applications. SIAM, Philadelphia (2007)

    Book  Google Scholar 

  3. Jain, A.K., Dubes, R.C.: Algorithms for clustering data. Prentice-Hall, New Jersey (2004)

    Google Scholar 

  4. Kil, D.H., Shin, F.B.: Reduced Dimension Image Compression and its Applications. In: Proc. of Int. Conference Image Processing, 3, (1995), pp. 500-503

  5. Li, C.K., Yuen, H.: A high-performance image compression technique for multimedia applications. IEEE Trans. Consum. Electron. 42(2), 239–243 (1996)

    Article  Google Scholar 

  6. Avcibas, I., Memon, N., Sayood, K.: A progressive lossless/near lossless image compression algorithm. IEEE Signal Proc. Lett. 9(10), 312–314 (2002). https://doi.org/10.1109/LSP.2002.804129

    Article  Google Scholar 

  7. Liao, X., Qin, Z., Ding, L.: Data embedding in digital images using critical functions. Signal Proc. Image Commun. 58, 146–156 (2017). https://doi.org/10.1016/j.image.2017.07.006

    Article  Google Scholar 

  8. Hussain, A.J., Fayadh, A.A., Radi, N.: Image compression techniques: a survey in lossless and lossy algorithm. Neurocomputing 300, 44–69 (2018). https://doi.org/10.1016/j.neucom.2018.02.094

    Article  Google Scholar 

  9. Kim S., Cho, N.I.: A Lossless Color Image Compression Method based on a new Reversible Color Transform. In: Proc. of IEEE Int. Conference on Visual Communications and Image Processing (2012). https://doi.org/10.1109/VCIP.2012.6410808

  10. Kumar, M., Anand, A.: An introduction to image compression. Int. J. Comp. Sci. Inf. Technol. Res. 2(2), 77–81 (2014)

    Google Scholar 

  11. Singh, V., Singh, O.P., Mishra, G.R.: A brief introduction on image compression techniques and standards. Int. J. Technol. Res. Adv. 2(2), 15–21 (2013)

    Google Scholar 

  12. Miklos A.: Comparison of spatial and frequency domain image compression methods. In: 4th International Conference and Workshop Mechatronics in Practice and Education(MECHEDU 2017), pp. 57–60, (2017)

  13. Zhang, X., Wandell, B.A.: A spatial extension of CIELAB for digital color-image reproduction. J. Soc. Inform. Display 5(1), 61–63 (1997)

    Article  Google Scholar 

  14. Shen, M.Y., Kuo, C.C.J.: Review of postprocessing techniques for compression artifact removal. J. Visual Commun. Image Represent. 9(1), 2–14 (1998). https://doi.org/10.1006/jvci.1997.0378

    Article  Google Scholar 

  15. Yang, S.: Vector Quantization of Deep Convolutional Neural Networks with Learned Codebook. In: 17th Canadian Workshop on Information Theory (CWIT) (2022). https://doi.org/10.1109/CWIT55308.2022.9817671

  16. Vali, M.H., Bäckström, T.: NSVQ: noise substitution in vector quantization for machine learning. IEEE Access 10, 13598–13610 (2022). https://doi.org/10.1109/ACCESS.2022.3147670

    Article  Google Scholar 

  17. Wong, T. Gargour, C.S., Batani, N.: Fuzzy learning vector quantization generation of codebooks. In: Proceedings 1995 Canadian Conference on Electrical and Computer Engineering, (1995). https://doi.org/10.1109/CCECE.1995.526673

  18. Omaima, N.A., AL-Allaf: Codebook enhancement in vector quantization image compression using backpropagation neural network. J. Appl. Sci. 11(17), 3152–3160 (2011). https://doi.org/10.3923/jas.2011.3152.3160

    Article  Google Scholar 

  19. Pandey, P., Kumar, R., Shah, P.K.: Vector Quantization with codebook and index compression. In: IEEE International Conference System Modelling & Advancement in Research Trends (SMART) (2016). https://doi.org/10.1109/SYSMART.2016.7894488

  20. Wang, L., Lu, Z., Ma, L., Feng, Y.: VQ codebook design using modified K-means algorithm with feature classification and grou** based initialization. Multimed. Tools Appl. 77, 8495–8510 (2018). https://doi.org/10.1007/s11042-017-4747-1

    Article  Google Scholar 

  21. Han, C., Chenb, Y.N., Loc, C.C., Wangd, C.T.: A novel approach for vector quantization using a neural network, mean shift, and principal component analysis-based seed re-initialization. Signal Process. 87(5), 799–810 (2007). https://doi.org/10.1016/j.sigpro.2006.08.006

    Article  Google Scholar 

  22. Vimala, S., Dev, K.K., Sathya, M.: Codebook generation for vector quantization using interpolations to compress gray scale images. Int. J. Comp. Appl. (2012). https://doi.org/10.5120/5719-7780

    Article  Google Scholar 

  23. Jaffery, Z.A., Singh, L., Ahmad, N.: Improved codebook design for vector quantization on orthogonal polynomials based transform coding. Int. J. Innova. Res. Comp. Commun. Eng. 6(2), 63–69 (2017). https://doi.org/10.17148/IJARCCE

    Article  Google Scholar 

  24. Barman, D., Hasnat, A., Barman, B.: Development of multi image compression technique based on common code vector. SN Comp. Sci. (2022). https://doi.org/10.1007/s42979-022-01450-0

    Article  Google Scholar 

  25. Hasnat, A., Barman, D.: A proposed multi-image compression technique. J. Intell. Fuzzy Syst. IOS Press 36(4), 3177–3193 (2019). https://doi.org/10.3233/JIFS-18360

    Article  Google Scholar 

  26. Tang, X., Yan, J., Song, Z., Zhang, X.: Deep Learning of Process Data with Supervised Variational Auto-encoder for Soft Sensor. In: IEEE 11th Data Driven Control and Learning Systems Conference (DDCLS), China, (2022). https://doi.org/10.1109/DDCLS55054.2022.9858451

  27. Zhou, J., Ju, L., Zhang, X.: A hybrid learning model based on auto-encoders. In: 12th IEEE Conference on Industrial Electronics and Applications (ICIEA), (2017) https://doi.org/10.1109/ICIEA.2017.8282900

  28. Sara, U., Akter, M., Uddin, M.S.: Image quality assessment through FSIM, SSIM, MSE and PSNR—A comparative study. J. Comp. Commun. 7(3), 8–18 (2019). https://doi.org/10.4236/jcc.2019.73002

    Article  Google Scholar 

  29. Mandal, K.: Reversible Steganography and Authentication via Transform Encoding. Springer, Berlin (2020). https://doi.org/10.1007/978-981-15-4397-5

    Book  Google Scholar 

  30. Al-Najjar, Y.A.Y., Soong, D.C.: Comparison of image quality assessment: PSNR, HVS, SSIM, UIQI. Int. J. Sci. Eng. Res. 3(8), 2229–5518 (2012)

    Google Scholar 

  31. Sirisha, B.L., Kumar, S.S., Mohan, B.C.: Steganography based information security with high embedding capacity. In: Proc of IEEE Int. Conference Recent Advances in Electronics & Computer Engineering (RAECE), (2015). https://doi.org/10.1109/RAECE.2015.7510218

  32. Schaefer, G., Stich, M.: UCID- An uncompressed color image database. SPIE Storage and Retrieval Methods and Applications for Multimedia, San Jose (2004). https://doi.org/10.1117/12.525375

    Book  Google Scholar 

Download references

Funding

This research receives no specific grant from any funding agency in the public, commercial, or not-for-profit sector.

Author information

Authors and Affiliations

Authors

Contributions

The study was conceived and designed by all of the authors. DB designed the experiments, collected and analyzed the data, and wrote the paper. AH and BB who are PhD guides of DB and SB helped to revise the manuscript. All authors agreed to be held accountable for the content of the final version of the manuscript.

Corresponding author

Correspondence to Dibyendu Barman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Barman, D., Hasnat, A., Begum, S. et al. A deep learning based multi-image compression technique. SIViP 18 (Suppl 1), 407–416 (2024). https://doi.org/10.1007/s11760-024-03163-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-024-03163-8

Keywords

Navigation