Abstract
Nowadays, with the rapid development of artificial intelligence (AI) technologies, more and more jobs can be fulfilled by underwater drones. Particularly, object detection with deep learning becomes an important issue which can significantly improve the performance of underwater drones. In this work, we analyzed and augmented an existing underwater dataset and proposed a new object detection dataset for inshore aquaculture. The renewal dataset is used for the training of the network for real-time object detection. The simulation results showed that the existing underwater dataset has a class imbalance problem. In addition, it is shown that although large objects are detected with high accuracy, rocks and small shadows are sometimes mis-detected as targets. It is due to the distance between the target object and the camera lenses, simulation of the background, and the object detection model not learning the difference between background and target. This indicates that object detection accuracy could be improved by the dataset that can accurately learn object features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Katayama T, Song T, Shimamoto T, Jiang X (2019) GAN-based color correction for underwater object detection. In: OCEANS19 MTS/IEEE. IEEE Press, Seattle, pp 1–4. https://doi.org/10.23919/OCEANS40490.2019.8962561
Li C, Guo C, Ren W, Cong R, Hou J, Kwong S, Tao D (2019) An underwater image enhancement benchmark dataset and beyond. IEEE Trans Image Process 29:4376–4389. IEEE Press. https://doi.org/10.1109/TIP.2019.2955241
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: CVPR. IEEE, Las Vegas, pp 779–788. https://doi.org/10.48550/ARXIV.1506.02640
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR, Columbus, OH, pp 580–587. https://doi.org/10.1109/CVPR.2014.81
Ge Z, Liu S, Wang F, Li Z, Sun J (2021) YOLOX: exceeding YOLO series in 2021. In: CVPR. https://doi.org/10.48550/ARXIV.2107.08430
Yang Z, Sinnott RO, Bailey J, Ke Q (2022) A survey of automated data augmentation algorithms for deep learning-based image classification tasks. In: CVPR. https://doi.org/10.48550/ARXIV.2206.06544
Tamura Y, Katayama T, Song T, Shimamoto (2022) Object recognition based self-position estimation for underwater robots. In: OCEANS 2022. IEEE Press, Hampton Roads, pp 1–5. https://doi.org/10.1109/OCEANS47191.2022.9977021
Lin TY, Fleet D, Pajdla T, Schiele B, Tuytelaars T (eds) (2014) Microsoft COCO: common objects in context. In: Computer vision—ECCV 2014. LNCS, vol 8693. Springer, Zurich, Switzerland, pp 740–755. https://doi.org/10.48550/ARXIV.1405.0312
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Imada, A., Katayama, T., Song, T., Shimamoto, T. (2024). Underwater Object Detection Through Analysis and Data Augmentation of Underwater Datasets. In: Yang, XS., Sherratt, R.S., Dey, N., Joshi, A. (eds) Proceedings of Eighth International Congress on Information and Communication Technology. ICICT 2023. Lecture Notes in Networks and Systems, vol 696. Springer, Singapore. https://doi.org/10.1007/978-981-99-3236-8_46
Download citation
DOI: https://doi.org/10.1007/978-981-99-3236-8_46
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-3235-1
Online ISBN: 978-981-99-3236-8
eBook Packages: EngineeringEngineering (R0)