Abstract
The current road damage detection (RDD) algorithms fail to achieve automatic and accurate evaluation and application in the traffic scenarios. In this paper, we propose a RDD algorithm YOLOv7-RDD based on the YOLOv7 model. The data augmentation method CutPaste is introduced for the first time, which can learn the irregularity of damage characteristics, construct pseudo damage samples with high similarity, and create a priori conditions for features extracted. We introduce the CBAM module into the ELAN module to resist the influence of interfering information. And it makes the model focus more on the feature of small objects and reduce the difficulty of the hard objects. In addition, we propose a new dataset RDDBJ, which contains five categories of road damage in 5390 images. And they are high-resolution from a top view, which are more suitable for detection and localization than others. Experiments on the RDDBJ dataset shows that the mAP reaches by 61.9% and is improved by 3.3% compared to the baseline, which is competitive and inspiring.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arya, D., Maeda, H., Ghosh, S.K., Toshniwal, D., Sekimoto, Y.: RDD 2020: an annotated image dataset for automatic road damage detection using deep learning. Data Brief 36, 107133 (2021)
Arya, D., Maeda, H., Ghosh, S.K., Toshniwal, D., Sekimoto, Y.: RDD2022: a multi-national image dataset for automatic road damage detection. ar**v preprint ar**v:2209.08538 (2022)
Arya, D., et al.: Global road damage detection: state-of-the-art solutions. In: 2020 IEEE International Conference on Big Data (Big Data), pp. 5533–5539. IEEE (2020)
Maeda, H., Kashiyama, T., Sekimoto, Y., Seto, T., Omata, H.: Generative adversarial network for road damage detection. Comput. Aided Civ. Infrastruct. Eng. 36(1), 47–60 (2021)
Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. ar**v preprint ar**v:2207.02696 (2022)
Ge, Z., Liu, S., Wang, F., Li, Z., Sun, J.: YOLOX: exceeding yolo series in 2021. ar**v preprint ar**v:2107.08430 (2021)
Zhu, X., Lyu, S., Wang, X., Zhao, Q.: TPH-YOLOv5: improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2778–2788 (2021)
Guo, Z., Wang, C., Yang, G., Huang, Z., Li, G.: MSFT-YOLO: Improved YOLOv5 based on transformer for detecting defects of steel surface. Sensors 22(9), 3467 (2022)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems 28 (2015)
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: Proceedings of the IEEE International Conference on computer vision, pp. 2961–2969 (2017)
Cai, Z., Vasconcelos, N.: Cascade R-CNN: delving into high quality object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6154–6162 (2018)
Singh, J., Shekhar, S.: Road damage detection and classification in smartphone captured images using mask R-CNN. ar**v preprint ar**v:1811.04535 (2018)
Li, C.L., Sohn, K., Yoon, J., Pfister, T.: CutPaste: self-supervised learning for anomaly detection and localization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9664–9674 (2021)
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D.: Mixup: beyond empirical risk minimization. ar**v preprint ar**v:1710.09412 (2017)
Yun, S., Han, D., Oh, S.J., Chun, S., Choe, J., Yoo, Y.: CutMix: regularization strategy to train strong classifiers with localizable features. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6023–6032 (2019)
Ghiasi, G., et al.: Simple copy-paste is a strong data augmentation method for instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2918–2928 (2021)
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham. (2016) https://doi.org/10.1007/978-3-319-46448-0_2
Cubuk, E.D., Zoph, B., Mane, D., Vasudevan, V., Le, Q.V.: Autoaugment: learning augmentation policies from data. ar**v preprint ar**v:1805.09501 (2018)
Guo, M.H., et al.: Attention mechanisms in computer vision: a survey. Comput. Vis. Media, 1–38 (2022)
Acknowledgments
This study was sponsored by the BUCEA Post Graduate Innovation Project [No. PG2022145].
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, D., Xu, Z. (2023). Improved YOLOv7 for Road Damage Detection. In: Qian, Z., Jabbar, M., Cheung, S.K.S., Li, X. (eds) Proceeding of 2022 International Conference on Wireless Communications, Networking and Applications (WCNA 2022). WCNA 2022. Lecture Notes in Electrical Engineering, vol 1059. Springer, Singapore. https://doi.org/10.1007/978-981-99-3951-0_61
Download citation
DOI: https://doi.org/10.1007/978-981-99-3951-0_61
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-3950-3
Online ISBN: 978-981-99-3951-0
eBook Packages: EngineeringEngineering (R0)