Abstract
Pedestrian detection is a classical problem in computer vision and has been a difficult problem to solve for a long time. Compared with face detection, it is very difficult to accurately detect pedestrians in various scenarios because of the complex posture of human body, larger deformation, and more serious problems such as attachment and occlusion. This paper focuses on the typical pedestrian detection model - YOLO model. Through experiments, the principle of pedestrian detection model algorithm and its model effect are studied to solve the difficulties in pedestrian detection. In YOLO, logistic regression is used to predict the object score of each boundary box, and multi-scale fusion is used to make prediction. By observing the mAP index, it is concluded that the YOLO algorithm has a good effect on single-label pedestrian detection, and the calculation efficiency is high.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mao, J., **ao, T., Jiang, Y., Cao, Z.: What can help pedestrian detection? In: Proceedings of the IEEE Conference on Computer Vision & Pattern Recognition. IEEE (2017)
Dalal, N.: Histograms of oriented gradients for human detection. In: Proceedings of CVPR (2005)
Angelova, A., Krizhevsky, A., Vanhoucke, V., et al.: Real-time pedestrian detection with deep network cascades. In: British Machine Vision Conference (2015)
Li, J., Liang, X., Shen, S.M., et al.: Scale-aware fast R-CNN for pedestrian detection. IEEE Trans. Multimedia 99, 1–1 (2015)
Wang, X., **ao, T., Jiang, Y., et al.: Repulsion loss: detecting pedestrians in a crowd. In: Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2018)
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. ar**v e-prints, 2018
Girshick, T., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, pp. 580–587 (2014). https://doi.org/10.1109/CVPR.2014.81
Ren, U., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017). https://doi.org/10.1109/TPAMI.2016.2577031
Redmon, L., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, pp. 779–788 (2016). https://doi.org/10.1109/CVPR.2016.91
Redmon, M., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, pp. 6517–6525 (2017). https://doi.org/10.1109/CVPR.2017.690
Peng, R., et al.: Pedestrian detection for transformer substation based on gaussian mixture model and YOLO. In: 2016 8th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), Hangzhou, China, pp. 562–565 (2016). https://doi.org/10.1109/IHMSC.2016.130
Xue, Y., Ju, Z., Li, Y., Zhang, W.: MAF-YOLO: multi-modal attention fusion based YOLO for pedestrian detection. Infrared Phys. Technol. 118, 103906 (2021) ISSN 1350–4495. https://doi.org/10.1016/j.infrared.2021.103906
Acknowledgments
The project is supported by Science and Technology Commission of Shanghai Municipality (Number 22dz1208505, 20DZ1202900, 19DZ1204200).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, Y., Hao, H., Zeng, X., Feng, D. (2023). Pedestrian Detection Model Algorithm Optimization Based on Deep Convolutional Neural Network. In: Zeng, X., **e, X., Sun, J., Ma, L., Chen, Y. (eds) Proceedings of the 5th International Symposium for Intelligent Transportation and Smart City (ITASC). ITASC 2022. Lecture Notes in Electrical Engineering, vol 1042. Springer, Singapore. https://doi.org/10.1007/978-981-99-2252-9_2
Download citation
DOI: https://doi.org/10.1007/978-981-99-2252-9_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-2251-2
Online ISBN: 978-981-99-2252-9
eBook Packages: EngineeringEngineering (R0)