Abstract
The identification of container surfaces carries a large amount of crucial information regarding production and logistics. Research on the detection and identification of containers is lacking in both academia and industry, and the efficiency is low due to the need for manual completion of related tasks. In order to tackle this problem, we have created a large-scale text detection dataset for container surface identification called IdentificationText. This dataset consists of 12,000 high-resolution images, providing bounding boxes annotations for text detection tasks. We have discussed downstream applications of the IdentificationText dataset as well as our annotation techniques used in the dataset’s creation. The text in this dataset exhibits challenges such as deformations, multi-direction, and multi-scale. We conducted extensive experiments to evaluate the effectiveness and difficulty of this dataset using advanced text detection methods. In our experiments, we found that repeated textures and vertical text at multiple scales would cause missed detections, which was an extremely serious problem. The experimental results indicate that it is challenging for current text detection methods to locating text on container surfaces. Achieving higher accuracy in detecting text on containers requires more in-depth research. The experimental results serve as the benchmark performance for the IdentificationText dataset, providing reference for future researchers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bengio, Y., Lecun, Y., Hinton, G.: Deep learning for AI. Commun. ACM 64(7), 58–65 (2021)
Karatzas, D., et al.: ICDAR 2013 Robust reading competition. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 1484–1493, Washington, DC, USA (2013)
Karatzas, D., et al.: ICDAR 2015 robust reading competition. In: 13th International Conference on Document Analysis and Recognition, pp. 1156–1160, Tunis, Tunisia (2015)
Gupta, A., Vedaldi, A., Zisserman, A.: Synthetic data for text localisation in natural images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2315–2324 (2016)
Ch’ng, C.K., Chan, C.S., Liu, C.L.: Total-text: towards orientation robustness in scene text detection. Int. J. Doc. Anal. Recogn. 23, 31–52 (2020)
Liu, Y., **, L., Zhang, S., et al.: Curved scene text detection via transverse and longitudinal sequence connection. Pattern Recogn. 90, 337–345 (2019)
Singh, A., Pang, G., Toh, M., et al.: Textocr: towards large-scale end-to-end reasoning for arbitrary-shaped scene text. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8802–8812 (2021)
Nayef, N., Yin, F., Bizid, I., et al.: ICDAR 2017 robust reading challenge on multi-lingual scene text detection and script identification-RRC-MLT. In: 14th International Conference on Document Analysis and Recognition, pp. 1454–1459 (2017)
Nayef, N., Patel, Y., Busta, M., et al.: ICDAR 2019 robust reading challenge on multi-lingual scene text detection and recognition-RRC-MLT-2019. In: 2019 International Conference on Document Analysis and Recognition, pp. 1582–1587(2019)
Rebuffi, S.A., Kolesnikov, A., Sperl, G., et al.: ICARL: Incremental classifier and representation learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2001–2010 (2017)
Long, S., Ruan, J., Zhang, W., et al.: Textsnake: a flexible representation for detecting text of arbitrary shapes. In: Proceedings of the European Conference on Computer Vision, pp. 20–36(2018)
Wang, W., **e, E., Song, X., et al.: Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 8440–8449 (2019)
Wang, W., **e, E., Li, X., et al.: Shape robust text detection with progressive scale expansion network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 9336–9345 (2019)
Liao, M., Wan, Z., Yao, C., et al.: Real-time scene text detection with differentiable binarization. Proc. AAAI Conf. Artif. Intell. 34(07), 11474–11481 (2020)
Liao, M., Zou, Z., Wan, Z., et al.: Real-time scene text detection with differentiable binarization and adaptive scale fusion. IEEE Trans. Pattern Anal. Mach. Intell. 45(1), 919–931 (2022)
Acknowledgements
This work was supported by Guangdong Basic and Applied Basic Research Foundation (No. 2021A1515011576), Guangdong, Hong Kong, Macao and the Greater Bay Area International Science and Technology Innovation Cooperation Project (No. 2021A050530080, No. 2021A0505060011), Key Research Projects for the Universities of Guangdong Provincial Education Department (No. 2020ZDZX3031, No. 2022ZDZX1032), Jiangmen Basic and Applied Basic Research Key Project (2021030103230006670), Jiangmen Science and Technology Plan Project (2220002000246), and Key Laboratory of Public Big Data in Guizhou Province (No. 2019BDKFJJ015), Development of a Container Intelligent Panoramic Trademark Quality Inspection System Based on Machine Vision(HX22105).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Liang, Z. et al. (2024). Performance Evaluation of Container Identification Detection Algorithm. In: Kountchev, R., Patnaik, S., Nakamatsu, K., Kountcheva, R. (eds) Proceedings of International Conference on Artificial Intelligence and Communication Technologies (ICAICT 2023). ICAICT 2023. Smart Innovation, Systems and Technologies, vol 368. Springer, Singapore. https://doi.org/10.1007/978-981-99-6641-7_21
Download citation
DOI: https://doi.org/10.1007/978-981-99-6641-7_21
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-6640-0
Online ISBN: 978-981-99-6641-7
eBook Packages: EngineeringEngineering (R0)