A deep learning based multi-image compression technique

Barman, Dibyendu; Hasnat, Abul; Begum, Shemim; Barman, Bandana

doi:10.1007/s11760-024-03163-8

A deep learning based multi-image compression technique

Original Paper
Published: 22 April 2024

Volume 18, pages 407–416, (2024)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Dibyendu Barman¹,
Abul Hasnat¹,
Shemim Begum¹ &
…
Bandana Barman²

124 Accesses
Explore all metrics

Abstract

A multi-image compression technique compresses multiple images of the same or various sizes together to generate a common codebook. In multi-image compression, the size of the common codebook or code vector matrix formed from multiple images is crucial to the algorithm's compression ratio performance. This codebook comprises codewords created by the multi-Image compression technique after various tuning settings have been modified. The compression ratio of the multi-image compression approach can be improved even more by lowering the size of the common code vector matrix. The common codebook or code vector matrix is reduced in size in this study using deep learning based auto-encoder technology. The encoded matrix is substantially smaller than the matrix formed using standard encoding techniques. For decoding purposes, information on the number of neurons and layers employed during encoding is also stored. The suggested approach is tested on a large number of standard photos and images from the UCID version 2 database. The experimental results are examined using compression ratio, PSNR, and SSIM. The results demonstrate that the suggested technique decreases the size of the common code vector matrix or codebook by 20%, improving overall algorithm performance by about 1.5% while maintaining the visual quality of the decompressed images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price includes VAT (France)

Instant access to the full article PDF.

Institutional subscriptions

2C-Net: integrate image compression and classification via deep neural network

Article 01 December 2022

Neural Multi-scale Image Compression

Neural Codes for Image Retrieval

Data availability

Not applicable.

References

Gonzalez, R.C., Woods, R.E., Eddins, S.L.: Digital Image processing using MATLB. Mc-Graw Hill, New York (2011)
Google Scholar
Gan, G., Ma, C., Wu, J.: Data clustering theory, algorithms and applications. SIAM, Philadelphia (2007)
Book Google Scholar
Jain, A.K., Dubes, R.C.: Algorithms for clustering data. Prentice-Hall, New Jersey (2004)
Google Scholar
Kil, D.H., Shin, F.B.: Reduced Dimension Image Compression and its Applications. In: Proc. of Int. Conference Image Processing, 3, (1995), pp. 500-503
Li, C.K., Yuen, H.: A high-performance image compression technique for multimedia applications. IEEE Trans. Consum. Electron. 42(2), 239–243 (1996)
Article Google Scholar
Avcibas, I., Memon, N., Sayood, K.: A progressive lossless/near lossless image compression algorithm. IEEE Signal Proc. Lett. 9(10), 312–314 (2002). https://doi.org/10.1109/LSP.2002.804129
Article Google Scholar
Liao, X., Qin, Z., Ding, L.: Data embedding in digital images using critical functions. Signal Proc. Image Commun. 58, 146–156 (2017). https://doi.org/10.1016/j.image.2017.07.006
Article Google Scholar
Hussain, A.J., Fayadh, A.A., Radi, N.: Image compression techniques: a survey in lossless and lossy algorithm. Neurocomputing 300, 44–69 (2018). https://doi.org/10.1016/j.neucom.2018.02.094
Article Google Scholar
Kim S., Cho, N.I.: A Lossless Color Image Compression Method based on a new Reversible Color Transform. In: Proc. of IEEE Int. Conference on Visual Communications and Image Processing (2012). https://doi.org/10.1109/VCIP.2012.6410808
Kumar, M., Anand, A.: An introduction to image compression. Int. J. Comp. Sci. Inf. Technol. Res. 2(2), 77–81 (2014)
Google Scholar
Singh, V., Singh, O.P., Mishra, G.R.: A brief introduction on image compression techniques and standards. Int. J. Technol. Res. Adv. 2(2), 15–21 (2013)
Google Scholar
Miklos A.: Comparison of spatial and frequency domain image compression methods. In: 4th International Conference and Workshop Mechatronics in Practice and Education(MECHEDU 2017), pp. 57–60, (2017)
Zhang, X., Wandell, B.A.: A spatial extension of CIELAB for digital color-image reproduction. J. Soc. Inform. Display 5(1), 61–63 (1997)
Article Google Scholar
Shen, M.Y., Kuo, C.C.J.: Review of postprocessing techniques for compression artifact removal. J. Visual Commun. Image Represent. 9(1), 2–14 (1998). https://doi.org/10.1006/jvci.1997.0378
Article Google Scholar
Yang, S.: Vector Quantization of Deep Convolutional Neural Networks with Learned Codebook. In: 17th Canadian Workshop on Information Theory (CWIT) (2022). https://doi.org/10.1109/CWIT55308.2022.9817671
Vali, M.H., Bäckström, T.: NSVQ: noise substitution in vector quantization for machine learning. IEEE Access 10, 13598–13610 (2022). https://doi.org/10.1109/ACCESS.2022.3147670
Article Google Scholar
Wong, T. Gargour, C.S., Batani, N.: Fuzzy learning vector quantization generation of codebooks. In: Proceedings 1995 Canadian Conference on Electrical and Computer Engineering, (1995). https://doi.org/10.1109/CCECE.1995.526673
Omaima, N.A., AL-Allaf: Codebook enhancement in vector quantization image compression using backpropagation neural network. J. Appl. Sci. 11(17), 3152–3160 (2011). https://doi.org/10.3923/jas.2011.3152.3160
Article Google Scholar
Pandey, P., Kumar, R., Shah, P.K.: Vector Quantization with codebook and index compression. In: IEEE International Conference System Modelling & Advancement in Research Trends (SMART) (2016). https://doi.org/10.1109/SYSMART.2016.7894488
Wang, L., Lu, Z., Ma, L., Feng, Y.: VQ codebook design using modified K-means algorithm with feature classification and grou** based initialization. Multimed. Tools Appl. 77, 8495–8510 (2018). https://doi.org/10.1007/s11042-017-4747-1
Article Google Scholar
Han, C., Chenb, Y.N., Loc, C.C., Wangd, C.T.: A novel approach for vector quantization using a neural network, mean shift, and principal component analysis-based seed re-initialization. Signal Process. 87(5), 799–810 (2007). https://doi.org/10.1016/j.sigpro.2006.08.006
Article Google Scholar
Vimala, S., Dev, K.K., Sathya, M.: Codebook generation for vector quantization using interpolations to compress gray scale images. Int. J. Comp. Appl. (2012). https://doi.org/10.5120/5719-7780
Article Google Scholar
Jaffery, Z.A., Singh, L., Ahmad, N.: Improved codebook design for vector quantization on orthogonal polynomials based transform coding. Int. J. Innova. Res. Comp. Commun. Eng. 6(2), 63–69 (2017). https://doi.org/10.17148/IJARCCE
Article Google Scholar
Barman, D., Hasnat, A., Barman, B.: Development of multi image compression technique based on common code vector. SN Comp. Sci. (2022). https://doi.org/10.1007/s42979-022-01450-0
Article Google Scholar
Hasnat, A., Barman, D.: A proposed multi-image compression technique. J. Intell. Fuzzy Syst. IOS Press 36(4), 3177–3193 (2019). https://doi.org/10.3233/JIFS-18360
Article Google Scholar
Tang, X., Yan, J., Song, Z., Zhang, X.: Deep Learning of Process Data with Supervised Variational Auto-encoder for Soft Sensor. In: IEEE 11th Data Driven Control and Learning Systems Conference (DDCLS), China, (2022). https://doi.org/10.1109/DDCLS55054.2022.9858451
Zhou, J., Ju, L., Zhang, X.: A hybrid learning model based on auto-encoders. In: 12th IEEE Conference on Industrial Electronics and Applications (ICIEA), (2017) https://doi.org/10.1109/ICIEA.2017.8282900
Sara, U., Akter, M., Uddin, M.S.: Image quality assessment through FSIM, SSIM, MSE and PSNR—A comparative study. J. Comp. Commun. 7(3), 8–18 (2019). https://doi.org/10.4236/jcc.2019.73002
Article Google Scholar
Mandal, K.: Reversible Steganography and Authentication via Transform Encoding. Springer, Berlin (2020). https://doi.org/10.1007/978-981-15-4397-5
Book Google Scholar
Al-Najjar, Y.A.Y., Soong, D.C.: Comparison of image quality assessment: PSNR, HVS, SSIM, UIQI. Int. J. Sci. Eng. Res. 3(8), 2229–5518 (2012)
Google Scholar
Sirisha, B.L., Kumar, S.S., Mohan, B.C.: Steganography based information security with high embedding capacity. In: Proc of IEEE Int. Conference Recent Advances in Electronics & Computer Engineering (RAECE), (2015). https://doi.org/10.1109/RAECE.2015.7510218
Schaefer, G., Stich, M.: UCID- An uncompressed color image database. SPIE Storage and Retrieval Methods and Applications for Multimedia, San Jose (2004). https://doi.org/10.1117/12.525375
Book Google Scholar

Download references

Funding

This research receives no specific grant from any funding agency in the public, commercial, or not-for-profit sector.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Government College of Engineering and Textile Technology, Berhampore, West Bengal, India
Dibyendu Barman, Abul Hasnat & Shemim Begum
Department of Electronics and Communication Engineering, Kalyani Government Engineering College, Kalyani, Nadia, West Bengal, India
Bandana Barman

Authors

Dibyendu Barman
View author publications
You can also search for this author in PubMed Google Scholar
Abul Hasnat
View author publications
You can also search for this author in PubMed Google Scholar
Shemim Begum
View author publications
You can also search for this author in PubMed Google Scholar
Bandana Barman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The study was conceived and designed by all of the authors. DB designed the experiments, collected and analyzed the data, and wrote the paper. AH and BB who are PhD guides of DB and SB helped to revise the manuscript. All authors agreed to be held accountable for the content of the final version of the manuscript.

Corresponding author

Correspondence to Dibyendu Barman.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Barman, D., Hasnat, A., Begum, S. et al. A deep learning based multi-image compression technique. SIViP 18 (Suppl 1), 407–416 (2024). https://doi.org/10.1007/s11760-024-03163-8

Download citation

Received: 23 February 2023
Revised: 24 February 2024
Accepted: 17 March 2024
Published: 22 April 2024
Issue Date: August 2024
DOI: https://doi.org/10.1007/s11760-024-03163-8

Keywords

Access this article

Log in via an institution

Price includes VAT (France)

Instant access to the full article PDF.

Institutional subscriptions

A deep learning based multi-image compression technique

Abstract

Access this article

Similar content being viewed by others

2C-Net: integrate image compression and classification via deep neural network

Neural Multi-scale Image Compression

Neural Codes for Image Retrieval

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A deep learning based multi-image compression technique

Abstract

Access this article

Similar content being viewed by others

2C-Net: integrate image compression and classification via deep neural network

Neural Multi-scale Image Compression

Neural Codes for Image Retrieval

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation