Abstract
There are many people today engaged in several activities spread over numerous locations. For a safe environment and safety, it may be necessary to describe the types of people and their behaviour at certain public meetings. Manpower can be quite expensive and not very useful when trying to observe a crowd. The existing technology makes it both more affordable and effective. We can suggest deploying closed-circuit cameras to keep an eye on a crowd of individuals and their behaviour. In this research, we have proposed the algorithm for identifying object by using deep Convolutional Neural Network (CNN) detector. This algorithm is used to extract the features from local datasets. Convolution, pooling, and affine layers are frequently combined in CNN. Convolution layers are created using a number of separate filters, each of which glides over the image to generate a map. The pooling layer then works to condense the visual representation. This proposed algorithm is more accurate and needs less processing time for evaluating the quality of object.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Jie B, Liu M, Shen D, Alzheimer’s Disease Neuroimaging Initiative (2021) Multi-modality integration via task-specific shared representation learning with application to diagnosis of Alzheimer’s disease. Med Image Anal 69:101986
Kim DS, Jung WS, Kim D, Park YH, Lee JH (2021) Alzheimer’s disease diagnosis using multi-modal deep learning on MRI and PET images. BMC Med Imaging 21(1):1–14
Wen J, Thibeau-Sutre E, Diaz-Melo M, Samper-González J, Routier A, Bottani S, Alzheimer’s Disease Neuroimaging Initiative (2020) Convolutional neural networks for classification of Alzheimer’s disease: overview and reproducible evaluation. Med Image Anal 63:101694
Yang Y, Zhang Q, Huang J, Ma Y (2020) Multi-modal brain MRI data analysis for Alzheimer’s disease diagnosis using multi-task deep learning. Med Image Anal 65:101765
Zhou Y, Chen X, Jiang Y, Liu X, Alzheimer’s Disease Neuroimaging Initiative (2020) Multi-modality fusion with deep neural networks for Alzheimer’s disease diagnosis. Comput Med Imaging Graph 79:101684
Khan SS, Mishra PK, Javed N, Ye B, Newman K, Mihailidis A, Iaboni A (2022) Unsupervised deep learning to detect agitation from videos in people with dementia. IEEE, 18 Jan 2022
Tan L, Li F, Zhang X, Wang P, Alzheimer’s Disease Neuroimaging Initiative (2019) Multi-modal medical image fusion and classification for Alzheimer’s disease diagnosis. Comput Med Imaging Graph 73:34–42
Nie D, Trullo R, Lian J, Wang L, Petitjean C, Ruan S, Wang Q et al (2019) Medical image synthesis with deep convolutional adversarial networks. IEEE Trans Biomed Eng 65(12):2720–2730
Lin X et al (2020) Task-oriented feature-fused network with multivariate dataset for joint face analysis. IEEE Trans Cybernet 50(3):1292–1305
Yu L, Chen H, Dou Q, Qin J, Heng PA (2017) Integrated texture feature-based ensemble framework for automated pathological brain detection. IEEE Trans Med Imaging 36(8):1856–1866
Ullah R, Hayat H, Siddiqui AA, Siddiqui UA, Khan J, Ullah F, Hassan S, Hasan L, Albattah W, Islam M, Karami GM (2022) A real-time framework for human face detection and recognition in CCTV images. Hindawi Mathematical Problems in Engineering, 3 Mar 2022
Lu PJ, Chuang J-H (2022) Fusion of multi-intensity image for deep learning-based human and face detection. IEEE 14 Jan 2022
Sambolek S, Ivasic-Kos M (2021) Automatic person detection in search and rescue operations using deep CNN detectors. IEEE, 4 Mar 2021
Li H, Chen H, Chen K, Song Z, Zhang Y, Liu B, Shen D (2021) Cross-modality multi-task learning for Alzheimer’s disease diagnosis. IEEE Trans Med Imaging 40(1):105–117
Chen X, Luo X, Weng J, Luo W, Li H, Tian Q (2021) Multi-view gait image generation for cross-view gait recognition. IEEE 30
Elharrouss O, Almaadeed N, Abualsaud K, Al-Maadeed S, Al-Ali A, Mohamed A (2022) FSC-set: counting, localization of football supporters crowd in the stadiums. IEEE, 20 Jan 2022
Al-Rfou R, Alain G, Almahairi A, Angermueller C, Bahdanau D, Ballas N, Goyal A et al. (2018) Theano: a python framework for fast computation of mathematical expressions. ar**v preprint ar**v:1605.02688
Yuan J, Cai J, Zhang X, Sun Q, Sun F, Zhu W (2021) Fusing skeleton recognition with face-TLD for human following of mobile service robots. IEEE Trans Syst Man Cybernet Syst 51(5):2963–2979
Hwang C-L, Deng Y-C, Pu S-E (2023) Human–robot collaboration using sequential-recurrent-convolution-network-based dynamic face emotion and wireless speech command recognitions. IEEE Access 11:37269–37282
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Vasudevan, I., Nithya, N.S. (2024). Multi-layered Object Identification and Detection Using Deep CNN Detector. In: Reddy, V.S., Wang, J., Reddy, K. (eds) Soft Computing and Signal Processing. ICSCSP 2023. Lecture Notes in Networks and Systems, vol 864. Springer, Singapore. https://doi.org/10.1007/978-981-99-8628-6_23
Download citation
DOI: https://doi.org/10.1007/978-981-99-8628-6_23
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8627-9
Online ISBN: 978-981-99-8628-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)