Improved YOLOv7 for Road Damage Detection

Zhang, Dongmei; Xu, Zhijie

doi:10.1007/978-981-99-3951-0_61

Dongmei Zhang⁴⁰ &
Zhijie Xu⁴⁰

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1059))

Included in the following conference series:

INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND APPLICATIONS

304 Accesses

Abstract

The current road damage detection (RDD) algorithms fail to achieve automatic and accurate evaluation and application in the traffic scenarios. In this paper, we propose a RDD algorithm YOLOv7-RDD based on the YOLOv7 model. The data augmentation method CutPaste is introduced for the first time, which can learn the irregularity of damage characteristics, construct pseudo damage samples with high similarity, and create a priori conditions for features extracted. We introduce the CBAM module into the ELAN module to resist the influence of interfering information. And it makes the model focus more on the feature of small objects and reduce the difficulty of the hard objects. In addition, we propose a new dataset RDDBJ, which contains five categories of road damage in 5390 images. And they are high-resolution from a top view, which are more suitable for detection and localization than others. Experiments on the RDDBJ dataset shows that the mAP reaches by 61.9% and is improved by 3.3% compared to the baseline, which is competitive and inspiring.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Thailand)

eBook: EUR 32.09; Price includes VAT (Thailand)

Hardcover Book: EUR 39.99; Price excludes VAT (Thailand)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Arya, D., Maeda, H., Ghosh, S.K., Toshniwal, D., Sekimoto, Y.: RDD 2020: an annotated image dataset for automatic road damage detection using deep learning. Data Brief 36, 107133 (2021)
Article Google Scholar
Arya, D., Maeda, H., Ghosh, S.K., Toshniwal, D., Sekimoto, Y.: RDD2022: a multi-national image dataset for automatic road damage detection. ar**v preprint ar**v:2209.08538 (2022)
Arya, D., et al.: Global road damage detection: state-of-the-art solutions. In: 2020 IEEE International Conference on Big Data (Big Data), pp. 5533–5539. IEEE (2020)
Google Scholar
Maeda, H., Kashiyama, T., Sekimoto, Y., Seto, T., Omata, H.: Generative adversarial network for road damage detection. Comput. Aided Civ. Infrastruct. Eng. 36(1), 47–60 (2021)
Article Google Scholar
Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. ar**v preprint ar**v:2207.02696 (2022)
Ge, Z., Liu, S., Wang, F., Li, Z., Sun, J.: YOLOX: exceeding yolo series in 2021. ar**v preprint ar**v:2107.08430 (2021)
Zhu, X., Lyu, S., Wang, X., Zhao, Q.: TPH-YOLOv5: improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2778–2788 (2021)
Google Scholar
Guo, Z., Wang, C., Yang, G., Huang, Z., Li, G.: MSFT-YOLO: Improved YOLOv5 based on transformer for detecting defects of steel surface. Sensors 22(9), 3467 (2022)
Article Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems 28 (2015)
Google Scholar
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: Proceedings of the IEEE International Conference on computer vision, pp. 2961–2969 (2017)
Google Scholar
Cai, Z., Vasconcelos, N.: Cascade R-CNN: delving into high quality object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6154–6162 (2018)
Google Scholar
Singh, J., Shekhar, S.: Road damage detection and classification in smartphone captured images using mask R-CNN. ar**v preprint ar**v:1811.04535 (2018)
Li, C.L., Sohn, K., Yoon, J., Pfister, T.: CutPaste: self-supervised learning for anomaly detection and localization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9664–9674 (2021)
Google Scholar
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Chapter Google Scholar
Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D.: Mixup: beyond empirical risk minimization. ar**v preprint ar**v:1710.09412 (2017)
Yun, S., Han, D., Oh, S.J., Chun, S., Choe, J., Yoo, Y.: CutMix: regularization strategy to train strong classifiers with localizable features. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6023–6032 (2019)
Google Scholar
Ghiasi, G., et al.: Simple copy-paste is a strong data augmentation method for instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2918–2928 (2021)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham. (2016) https://doi.org/10.1007/978-3-319-46448-0_2
Cubuk, E.D., Zoph, B., Mane, D., Vasudevan, V., Le, Q.V.: Autoaugment: learning augmentation policies from data. ar**v preprint ar**v:1805.09501 (2018)
Guo, M.H., et al.: Attention mechanisms in computer vision: a survey. Comput. Vis. Media, 1–38 (2022)
Google Scholar

Download references

Acknowledgments

This study was sponsored by the BUCEA Post Graduate Innovation Project [No. PG2022145].

Author information

Authors and Affiliations

School of Science, Bei**g University of Civil Engineering and Architecture, Bei**g, 102616, China
Dongmei Zhang & Zhijie Xu

Authors

Dongmei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhijie Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhijie Xu .

Editor information

Editors and Affiliations

College of Communication Engineering, Jilin University, Jilin, China
Zhihong Qian
Vardhaman College of Engineering, Hyderabad, Telangana, India
M.A. Jabbar
Hong Kong Metropolitan University, Kowloon, Hong Kong
Simon K. S. Cheung
College of Technology, Indiana State University, Terre Haute, IN, USA
**aolong Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, D., Xu, Z. (2023). Improved YOLOv7 for Road Damage Detection. In: Qian, Z., Jabbar, M., Cheung, S.K.S., Li, X. (eds) Proceeding of 2022 International Conference on Wireless Communications, Networking and Applications (WCNA 2022). WCNA 2022. Lecture Notes in Electrical Engineering, vol 1059. Springer, Singapore. https://doi.org/10.1007/978-981-99-3951-0_61

Download citation

DOI: https://doi.org/10.1007/978-981-99-3951-0_61
Published: 27 July 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-3950-3
Online ISBN: 978-981-99-3951-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics