Image category classification using 12-Layer deep convolutional neural network

Veeramani, Vijayaraghavan; Mohan, Laavanya

doi:10.1007/s11042-023-15631-3

Image category classification using 12-Layer deep convolutional neural network

Published: 20 May 2023

Volume 83, pages 4017–4036, (2024)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

154 Accesses
Explore all metrics

Abstract

In comparison to human vision, it’s hard for systems to understand images and figure them out on their own. In the modern world, image processing is mostly done by convolutional neural networks that have been learned over time. As a result, our system categorizes real-time natural colour photographs using deep learning. In the majority of convolutional networks, the activation function is often a form of the rectified linear unit, which is prone to vanishing and exploding gradient problems. Though numerous studies have proposed solutions for resolving this issue, there has yet to be an efficient and feasible approach. To overcome this issue in our method, we combined Rectified Linear Unit (ReLU) variations without altering the activation functions or adding more layers. There are 12 hidden layers in the proposed network, and ten separate image categories were made from the CIFAR10 data set. In the experiment, we got 89.13 percent accuracy which is best when we compared our model to Alex net, Google net, and Resnet18. This shows that our model is better than convolutional neural networks with rectified linear units at categorizing natural pictures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 8

Classification of Images as Photographs or Paintings by Using Convolutional Neural Networks

An Efficient Deep Convolutional Neural Network for Visual Image Classification

Image Classification with Convolutional Neural Networks

Data availability

CIFAR-10 Matlab version dataset is used in this work and accessed from the link https://www.cs.toronto.edu/~kriz/cifar.html for comparison of the proposed work Imagenet dataset is also used.

Code availability

Use our own code with Matlab 2022a software in Windows 10 OS and compared with state of art networks like Alex net, Google net and Resnet18.

References

Ba JL, Kiros JR, Hinton GE (2016) Layer Normalization, Machine Learning. https://doi.org/10.48550/ar**v.1607.06450
Bharadi V, Panchbhai MN, Mukadam AI, Rode NN (2017) Image classification using deep learning. Int J Eng Res Technol 6(11):17–19
Google Scholar
Bryson AE, Ho YC, Siouris GM (1979) Applied optimal control: optimization, estimation and control. IEEE Trans Syst Man Cybern 9(6):366–368. https://doi.org/10.1109/TSMC.1979.4310229
Article Google Scholar
Cireşan D, Meier U, Masci J, Gambardella LM, Schmidhuber J (2011) Flexible, High performance convolutional neural networks for image classification, in Proc. 22nd international joint conference on Artificial Intelligence, Barcelona, Catalonia, Spain, 1237–1242. https://doi.org/10.5591/978-1-57735-516-8/IJCAI11-210
Diwakar M, Kumar P (2019) Wavelet Packet Based CT Image Denoising Using Bilateral Method and Bayes Shrinkage Rule, in Handbook of Multimedia Information Security: Techniques and Applications, A. K. Singh and A. Mohan, Eds. Cham: Springer International Publishing. 501–511. https://doi.org/10.1007/978-3-030-15887-3_24
Diwakar M, Kumar M (2018) A review on CT image noise and its denoising. Biomed Signal Process Control 42:73–88. https://doi.org/10.1016/j.bspc.2018.01.010
Article Google Scholar
Diwakar M, Kumar P, Singh AK (2020) CT image denoising using NLM and its method noise thresholding. Multimed Tools Appl 79(21):14449–14464. https://doi.org/10.1007/s11042-018-6897-1
Article Google Scholar
Diwakar M, Singh P (2020) CT image denoising using multivariate model and its method noise thresholding in non-subsampled shearlet domain. Biomed Signal Process Control 57:101754. https://doi.org/10.1016/j.bspc.2019.101754
Article Google Scholar
Diwakar M, Sonam, and Kumar M (2015) CT image denoising based on complex wavelet transform using local adaptive thresholding and Bilateral filtering, in Proceedings of the Third International Symposium on Women in Computing and Informatics, New York, NY, USA. 297–302. https://doi.org/10.1145/2791405.2791430
Diwakar M, Verma A, Lamba S, Gupta H (2019) Inter- and Intra-scale Dependencies-Based CT Image Denoising in Curvelet Domain: Proceedings of SoCTA 2017. 343–350. https://doi.org/10.1007/978-981-13-0589-4_32
Elangovan P, Nath MK (2020) Glaucoma Assessment from Color Fundus Images using Convolutional Neural Network. International Journal of Imaging Systems and Technology 31(02):955–971. https://doi.org/10.1002/ima.22494
Article Google Scholar
Hao W, Yizhou W, Yaqin L, Zhili S (2020) The Role of Activation Function in CNN, in Proc. 2nd IEEE International Conference on Information Technology and Computer Application. https://doi.org/10.1109/ITCA52113.2020.00096
He K, Zhang X, Ren S, Sun J (2015) Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, in Proc. 2015 IEEE International Conference on Computer Vision. https://doi.org/10.1109/ICCV.2015.123
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto A, Adam H, (2017) MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications, in Proc. IEEE International Conference on Computer Vision and Pattern Recognition. https://doi.org/10.48550/ar**v.1704.04861
Ioffe S, Szegedy C (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, in Proc. 32nd International Conference on International Conference on Machine Learning, 37:448–456. http://proceedings.mlr.press/v37/ioffe15.pdf. Accessed 26 April 2022
Juneja M, Vedaldi A, Jawahar CV, Zisserman A (2013) Blocks that shout: Distinctive parts for scene classification, in Proc. IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA. 923–930. https://doi.org/10.1109/CVPR.2013.124
Kar MK, Nath MK, Neog DR (2021) A Review on Progress in Semantic Image Segmentation and Its Application to Medical Images, SN Computer Science, Springer Nature 2:397. https://doi.org/10.1007/s42979-021-00784-5
Kohonen T (1988) An introduction to neural computing. Neural Netw-Elsevier 1(1):3–16. https://doi.org/10.1016/0893-6080(88)90020-2
Article Google Scholar
Krishna MM, Neelima M, Harshali M, Rao MVG (2018) Image classification using deep learning. Int J Eng Technol 7(2): 614–617. https://www.sciencepubco.com/index.php/ijet/article/view/10892. Accessed 10 May 2023
Krizhevsky A (2010) Convolutional deep belief networks on CIFAR-10, Department of Computer Science, University of Toronto. https://www.cs.toronto.edu/~kriz/conv-cifar10-aug2010.pdf. Accessed 10 May 2023
Laavanya M, Vijayaraghavan V (2020) Residual Learning of Transfer Learned Alex Net for Image Denoising. IEIE Trans Smart Process Comput 9(2):135–141. https://doi.org/10.5573/IEIESPC.2020.9.2.135
Article Google Scholar
Laavanya M, Vijayaraghavan V (2021) Image Denoising with Convolution Neural Network using Gaussian Filtered Residuals. IEIE Trans Smart Process Comput 10(2):96–100. https://doi.org/10.5573/IEIESPC.2021.10.2.096
Article Google Scholar
LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Back propagation applied to handwritten zip code recognition. IEEE J Neural Comput 1(4):541–551. https://doi.org/10.1162/neco.1989.1.4.541
Article Google Scholar
Lee H, Grosse R, Ranganath R, Ng AY (2009) Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations, in Proc. 26th ACM Annual International Conference on Machine Learning, Montreal, Quebec, Canada, 609–616. https://doi.org/10.1145/1553374.1553453
Li W, Wu G, Zhang F, Du Q (2017) Hyper spectral image classification using deep pixel-pair features. IEEE Trans Geosci Remote Sens 55(2):844–853. https://doi.org/10.1109/TGRS.2016.2616355
Article Google Scholar
Lin G, Shen W (2018) Research on convolutional neural network based on improved Relu piecewise activation function. Procedia Comput Sci 131:977–984. https://doi.org/10.1016/j.procs.2018.04.239
Article Google Scholar
Lu C-T, Chen R-H, Wang L-L, Lin J-A (2020) Image enhancement using convolutional neural network to identify similar patterns. IET Image Processing 14(15):3880–3889. https://doi.org/10.1049/iet-ipr.2020.0560
Article Google Scholar
Maas AL, Awni HY, Andrew NY (2013) Rectifier nonlinearities improve neural network acoustic models, in Proc. ICML Workshop on Deep Learning for Audio, Speech, and Language Processing, 1–9. https://awnihannun.com/papers/relu_hybrid_icml2013_final.pdf. Accessed 10 May 2023
Mitchell TM (1997) Machine Learning, 1st edn. McGraw-Hill Inc, New York
Google Scholar
Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines, in Proc. 27th International Conference on International Conference on Machine Learning, Haifa, Israel, 807–814. https://www.cs.toronto.edu/~fritz/absps/reluICML.pdf. Accessed 10 May 2023
Ramprasath M, Anand MV, Hariharan S (2018) Image classification using convolutional neural networks, Int. J Pure Appl Math 119(17):1307–1319. https://www.acadpubl.eu/hub/2018-119-17/4/419.pdf. Accessed 10 May 2023
Rawat W, Wang Z (2017) Deep convolutional neural networks for image classification: A comprehensive review. Neural Comput 29:2352–2449. https://doi.org/10.1162/NECO_a_00990
Article MathSciNet Google Scholar
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Int J Sci Nature 323:533–536
Google Scholar
Sharif M, Kausar A, Park J, Shin DR (2019) Tiny image classification using four-block convolutional neural network, in Proc. 2019 International Conference on Information and Communication Technology Convergence. https://doi.org/10.1109/ictc46691.2019.8940002
Simard PY, Steinkraus D, Platt JC (2003) Best practices for convolutional neural networks applied to visual document analysis, in Proc. 7th IEEE International Conference on Document Analysis and Recognition, Edinburgh, UK 2: 958–963. https://doi.org/10.1109/ICDAR.2003.1227801
Uk I (2017) A Review on Image Enhancement Techniques. Int J Eng Appl Comput Sci IJEACS 02:232–235. https://doi.org/10.24032/ijeacs/0207/05
Article Google Scholar
Vijayaraghavan V, Laavanya M (2019) Vehicle classification and detection using deep learning. Int J Eng Adv Technol 9(1S5):24–28. https://doi.org/10.35940/ijeat.A1006.1291S52019
Article Google Scholar
Werbos PJ (1974) Beyond regression: New tools for prediction and analysis in the behavioral sciences, Ph. D. Dissertation, University of Harvard, Washington, USA
Wu Y, He K (2018) Group Normalization, Facebook AI Research, in proc. European Conference on Computer Vision. https://openaccess.thecvf.com/content_ECCV_2018/papers/Yuxin_Wu_Group_Normalization_ECCV_2018_paper.pdf. Accessed 26 April 2022
Zheng H, Jiaojiao Z, Yun G (2021) Handling Vanishing Gradient Problem Using Artificial Derivative. IEEE J Mag 9:22371–22377. https://doi.org/10.1109/ACCESS.2021.3054915
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics and Communication Engineering, Vignan’s Foundation for Science Technology and Research (Deemed to Be University), Vadlamudi, Guntur, Andhra Pradesh, 522213, India
Vijayaraghavan Veeramani & Laavanya Mohan

Authors

Vijayaraghavan Veeramani
View author publications
You can also search for this author in PubMed Google Scholar
Laavanya Mohan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vijayaraghavan Veeramani.

Ethics declarations

Conflicts of interest

There is no conflict of interest with any person or body regarding this work.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Veeramani, V., Mohan, L. Image category classification using 12-Layer deep convolutional neural network. Multimed Tools Appl 83, 4017–4036 (2024). https://doi.org/10.1007/s11042-023-15631-3

Download citation

Received: 16 January 2022
Revised: 27 April 2022
Accepted: 22 April 2023
Published: 20 May 2023
Issue Date: January 2024
DOI: https://doi.org/10.1007/s11042-023-15631-3

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Image category classification using 12-Layer deep convolutional neural network

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Classification of Images as Photographs or Paintings by Using Convolutional Neural Networks

An Efficient Deep Convolutional Neural Network for Visual Image Classification

Image Classification with Convolutional Neural Networks

Data availability

Code availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Image category classification using 12-Layer deep convolutional neural network

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Classification of Images as Photographs or Paintings by Using Convolutional Neural Networks

An Efficient Deep Convolutional Neural Network for Visual Image Classification

Image Classification with Convolutional Neural Networks

Data availability

Code availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation