Abstract
A multi-image compression technique compresses multiple images of the same or various sizes together to generate a common codebook. In multi-image compression, the size of the common codebook or code vector matrix formed from multiple images is crucial to the algorithm's compression ratio performance. This codebook comprises codewords created by the multi-Image compression technique after various tuning settings have been modified. The compression ratio of the multi-image compression approach can be improved even more by lowering the size of the common code vector matrix. The common codebook or code vector matrix is reduced in size in this study using deep learning based auto-encoder technology. The encoded matrix is substantially smaller than the matrix formed using standard encoding techniques. For decoding purposes, information on the number of neurons and layers employed during encoding is also stored. The suggested approach is tested on a large number of standard photos and images from the UCID version 2 database. The experimental results are examined using compression ratio, PSNR, and SSIM. The results demonstrate that the suggested technique decreases the size of the common code vector matrix or codebook by 20%, improving overall algorithm performance by about 1.5% while maintaining the visual quality of the decompressed images.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03163-8/MediaObjects/11760_2024_3163_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03163-8/MediaObjects/11760_2024_3163_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03163-8/MediaObjects/11760_2024_3163_Fig3a_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03163-8/MediaObjects/11760_2024_3163_Fig3b_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs11760-024-03163-8/MediaObjects/11760_2024_3163_Fig4_HTML.png)
Similar content being viewed by others
Data availability
Not applicable.
References
Gonzalez, R.C., Woods, R.E., Eddins, S.L.: Digital Image processing using MATLB. Mc-Graw Hill, New York (2011)
Gan, G., Ma, C., Wu, J.: Data clustering theory, algorithms and applications. SIAM, Philadelphia (2007)
Jain, A.K., Dubes, R.C.: Algorithms for clustering data. Prentice-Hall, New Jersey (2004)
Kil, D.H., Shin, F.B.: Reduced Dimension Image Compression and its Applications. In: Proc. of Int. Conference Image Processing, 3, (1995), pp. 500-503
Li, C.K., Yuen, H.: A high-performance image compression technique for multimedia applications. IEEE Trans. Consum. Electron. 42(2), 239–243 (1996)
Avcibas, I., Memon, N., Sayood, K.: A progressive lossless/near lossless image compression algorithm. IEEE Signal Proc. Lett. 9(10), 312–314 (2002). https://doi.org/10.1109/LSP.2002.804129
Liao, X., Qin, Z., Ding, L.: Data embedding in digital images using critical functions. Signal Proc. Image Commun. 58, 146–156 (2017). https://doi.org/10.1016/j.image.2017.07.006
Hussain, A.J., Fayadh, A.A., Radi, N.: Image compression techniques: a survey in lossless and lossy algorithm. Neurocomputing 300, 44–69 (2018). https://doi.org/10.1016/j.neucom.2018.02.094
Kim S., Cho, N.I.: A Lossless Color Image Compression Method based on a new Reversible Color Transform. In: Proc. of IEEE Int. Conference on Visual Communications and Image Processing (2012). https://doi.org/10.1109/VCIP.2012.6410808
Kumar, M., Anand, A.: An introduction to image compression. Int. J. Comp. Sci. Inf. Technol. Res. 2(2), 77–81 (2014)
Singh, V., Singh, O.P., Mishra, G.R.: A brief introduction on image compression techniques and standards. Int. J. Technol. Res. Adv. 2(2), 15–21 (2013)
Miklos A.: Comparison of spatial and frequency domain image compression methods. In: 4th International Conference and Workshop Mechatronics in Practice and Education(MECHEDU 2017), pp. 57–60, (2017)
Zhang, X., Wandell, B.A.: A spatial extension of CIELAB for digital color-image reproduction. J. Soc. Inform. Display 5(1), 61–63 (1997)
Shen, M.Y., Kuo, C.C.J.: Review of postprocessing techniques for compression artifact removal. J. Visual Commun. Image Represent. 9(1), 2–14 (1998). https://doi.org/10.1006/jvci.1997.0378
Yang, S.: Vector Quantization of Deep Convolutional Neural Networks with Learned Codebook. In: 17th Canadian Workshop on Information Theory (CWIT) (2022). https://doi.org/10.1109/CWIT55308.2022.9817671
Vali, M.H., Bäckström, T.: NSVQ: noise substitution in vector quantization for machine learning. IEEE Access 10, 13598–13610 (2022). https://doi.org/10.1109/ACCESS.2022.3147670
Wong, T. Gargour, C.S., Batani, N.: Fuzzy learning vector quantization generation of codebooks. In: Proceedings 1995 Canadian Conference on Electrical and Computer Engineering, (1995). https://doi.org/10.1109/CCECE.1995.526673
Omaima, N.A., AL-Allaf: Codebook enhancement in vector quantization image compression using backpropagation neural network. J. Appl. Sci. 11(17), 3152–3160 (2011). https://doi.org/10.3923/jas.2011.3152.3160
Pandey, P., Kumar, R., Shah, P.K.: Vector Quantization with codebook and index compression. In: IEEE International Conference System Modelling & Advancement in Research Trends (SMART) (2016). https://doi.org/10.1109/SYSMART.2016.7894488
Wang, L., Lu, Z., Ma, L., Feng, Y.: VQ codebook design using modified K-means algorithm with feature classification and grou** based initialization. Multimed. Tools Appl. 77, 8495–8510 (2018). https://doi.org/10.1007/s11042-017-4747-1
Han, C., Chenb, Y.N., Loc, C.C., Wangd, C.T.: A novel approach for vector quantization using a neural network, mean shift, and principal component analysis-based seed re-initialization. Signal Process. 87(5), 799–810 (2007). https://doi.org/10.1016/j.sigpro.2006.08.006
Vimala, S., Dev, K.K., Sathya, M.: Codebook generation for vector quantization using interpolations to compress gray scale images. Int. J. Comp. Appl. (2012). https://doi.org/10.5120/5719-7780
Jaffery, Z.A., Singh, L., Ahmad, N.: Improved codebook design for vector quantization on orthogonal polynomials based transform coding. Int. J. Innova. Res. Comp. Commun. Eng. 6(2), 63–69 (2017). https://doi.org/10.17148/IJARCCE
Barman, D., Hasnat, A., Barman, B.: Development of multi image compression technique based on common code vector. SN Comp. Sci. (2022). https://doi.org/10.1007/s42979-022-01450-0
Hasnat, A., Barman, D.: A proposed multi-image compression technique. J. Intell. Fuzzy Syst. IOS Press 36(4), 3177–3193 (2019). https://doi.org/10.3233/JIFS-18360
Tang, X., Yan, J., Song, Z., Zhang, X.: Deep Learning of Process Data with Supervised Variational Auto-encoder for Soft Sensor. In: IEEE 11th Data Driven Control and Learning Systems Conference (DDCLS), China, (2022). https://doi.org/10.1109/DDCLS55054.2022.9858451
Zhou, J., Ju, L., Zhang, X.: A hybrid learning model based on auto-encoders. In: 12th IEEE Conference on Industrial Electronics and Applications (ICIEA), (2017) https://doi.org/10.1109/ICIEA.2017.8282900
Sara, U., Akter, M., Uddin, M.S.: Image quality assessment through FSIM, SSIM, MSE and PSNR—A comparative study. J. Comp. Commun. 7(3), 8–18 (2019). https://doi.org/10.4236/jcc.2019.73002
Mandal, K.: Reversible Steganography and Authentication via Transform Encoding. Springer, Berlin (2020). https://doi.org/10.1007/978-981-15-4397-5
Al-Najjar, Y.A.Y., Soong, D.C.: Comparison of image quality assessment: PSNR, HVS, SSIM, UIQI. Int. J. Sci. Eng. Res. 3(8), 2229–5518 (2012)
Sirisha, B.L., Kumar, S.S., Mohan, B.C.: Steganography based information security with high embedding capacity. In: Proc of IEEE Int. Conference Recent Advances in Electronics & Computer Engineering (RAECE), (2015). https://doi.org/10.1109/RAECE.2015.7510218
Schaefer, G., Stich, M.: UCID- An uncompressed color image database. SPIE Storage and Retrieval Methods and Applications for Multimedia, San Jose (2004). https://doi.org/10.1117/12.525375
Funding
This research receives no specific grant from any funding agency in the public, commercial, or not-for-profit sector.
Author information
Authors and Affiliations
Contributions
The study was conceived and designed by all of the authors. DB designed the experiments, collected and analyzed the data, and wrote the paper. AH and BB who are PhD guides of DB and SB helped to revise the manuscript. All authors agreed to be held accountable for the content of the final version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Ethical approval
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Barman, D., Hasnat, A., Begum, S. et al. A deep learning based multi-image compression technique. SIViP 18 (Suppl 1), 407–416 (2024). https://doi.org/10.1007/s11760-024-03163-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-024-03163-8