Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1059))

  • 304 Accesses

Abstract

The current road damage detection (RDD) algorithms fail to achieve automatic and accurate evaluation and application in the traffic scenarios. In this paper, we propose a RDD algorithm YOLOv7-RDD based on the YOLOv7 model. The data augmentation method CutPaste is introduced for the first time, which can learn the irregularity of damage characteristics, construct pseudo damage samples with high similarity, and create a priori conditions for features extracted. We introduce the CBAM module into the ELAN module to resist the influence of interfering information. And it makes the model focus more on the feature of small objects and reduce the difficulty of the hard objects. In addition, we propose a new dataset RDDBJ, which contains five categories of road damage in 5390 images. And they are high-resolution from a top view, which are more suitable for detection and localization than others. Experiments on the RDDBJ dataset shows that the mAP reaches by 61.9% and is improved by 3.3% compared to the baseline, which is competitive and inspiring.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (Thailand)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 32.09
Price includes VAT (Thailand)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
EUR 39.99
Price excludes VAT (Thailand)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Arya, D., Maeda, H., Ghosh, S.K., Toshniwal, D., Sekimoto, Y.: RDD 2020: an annotated image dataset for automatic road damage detection using deep learning. Data Brief 36, 107133 (2021)

    Article  Google Scholar 

  2. Arya, D., Maeda, H., Ghosh, S.K., Toshniwal, D., Sekimoto, Y.: RDD2022: a multi-national image dataset for automatic road damage detection. ar**v preprint ar**v:2209.08538 (2022)

  3. Arya, D., et al.: Global road damage detection: state-of-the-art solutions. In: 2020 IEEE International Conference on Big Data (Big Data), pp. 5533–5539. IEEE (2020)

    Google Scholar 

  4. Maeda, H., Kashiyama, T., Sekimoto, Y., Seto, T., Omata, H.: Generative adversarial network for road damage detection. Comput. Aided Civ. Infrastruct. Eng. 36(1), 47–60 (2021)

    Article  Google Scholar 

  5. Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. ar**v preprint ar**v:2207.02696 (2022)

  6. Ge, Z., Liu, S., Wang, F., Li, Z., Sun, J.: YOLOX: exceeding yolo series in 2021. ar**v preprint ar**v:2107.08430 (2021)

  7. Zhu, X., Lyu, S., Wang, X., Zhao, Q.: TPH-YOLOv5: improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2778–2788 (2021)

    Google Scholar 

  8. Guo, Z., Wang, C., Yang, G., Huang, Z., Li, G.: MSFT-YOLO: Improved YOLOv5 based on transformer for detecting defects of steel surface. Sensors 22(9), 3467 (2022)

    Article  Google Scholar 

  9. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems 28 (2015)

    Google Scholar 

  10. He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: Proceedings of the IEEE International Conference on computer vision, pp. 2961–2969 (2017)

    Google Scholar 

  11. Cai, Z., Vasconcelos, N.: Cascade R-CNN: delving into high quality object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6154–6162 (2018)

    Google Scholar 

  12. Singh, J., Shekhar, S.: Road damage detection and classification in smartphone captured images using mask R-CNN. ar**v preprint ar**v:1811.04535 (2018)

  13. Li, C.L., Sohn, K., Yoon, J., Pfister, T.: CutPaste: self-supervised learning for anomaly detection and localization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9664–9674 (2021)

    Google Scholar 

  14. Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1

    Chapter  Google Scholar 

  15. Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D.: Mixup: beyond empirical risk minimization. ar**v preprint ar**v:1710.09412 (2017)

  16. Yun, S., Han, D., Oh, S.J., Chun, S., Choe, J., Yoo, Y.: CutMix: regularization strategy to train strong classifiers with localizable features. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6023–6032 (2019)

    Google Scholar 

  17. Ghiasi, G., et al.: Simple copy-paste is a strong data augmentation method for instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2918–2928 (2021)

    Google Scholar 

  18. Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham. (2016) https://doi.org/10.1007/978-3-319-46448-0_2

  19. Cubuk, E.D., Zoph, B., Mane, D., Vasudevan, V., Le, Q.V.: Autoaugment: learning augmentation policies from data. ar**v preprint ar**v:1805.09501 (2018)

  20. Guo, M.H., et al.: Attention mechanisms in computer vision: a survey. Comput. Vis. Media, 1–38 (2022)

    Google Scholar 

Download references

Acknowledgments

This study was sponsored by the BUCEA Post Graduate Innovation Project [No. PG2022145].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhijie Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, D., Xu, Z. (2023). Improved YOLOv7 for Road Damage Detection. In: Qian, Z., Jabbar, M., Cheung, S.K.S., Li, X. (eds) Proceeding of 2022 International Conference on Wireless Communications, Networking and Applications (WCNA 2022). WCNA 2022. Lecture Notes in Electrical Engineering, vol 1059. Springer, Singapore. https://doi.org/10.1007/978-981-99-3951-0_61

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-3951-0_61

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-3950-3

  • Online ISBN: 978-981-99-3951-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Navigation