Performance Evaluation of Container Identification Detection Algorithm

  • Conference paper
  • First Online:
Proceedings of International Conference on Artificial Intelligence and Communication Technologies (ICAICT 2023) (ICAICT 2023)

Abstract

The identification of container surfaces carries a large amount of crucial information regarding production and logistics. Research on the detection and identification of containers is lacking in both academia and industry, and the efficiency is low due to the need for manual completion of related tasks. In order to tackle this problem, we have created a large-scale text detection dataset for container surface identification called IdentificationText. This dataset consists of 12,000 high-resolution images, providing bounding boxes annotations for text detection tasks. We have discussed downstream applications of the IdentificationText dataset as well as our annotation techniques used in the dataset’s creation. The text in this dataset exhibits challenges such as deformations, multi-direction, and multi-scale. We conducted extensive experiments to evaluate the effectiveness and difficulty of this dataset using advanced text detection methods. In our experiments, we found that repeated textures and vertical text at multiple scales would cause missed detections, which was an extremely serious problem. The experimental results indicate that it is challenging for current text detection methods to locating text on container surfaces. Achieving higher accuracy in detecting text on containers requires more in-depth research. The experimental results serve as the benchmark performance for the IdentificationText dataset, providing reference for future researchers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 159.50
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
GBP 199.99
Price includes VAT (United Kingdom)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bengio, Y., Lecun, Y., Hinton, G.: Deep learning for AI. Commun. ACM 64(7), 58–65 (2021)

    Article  Google Scholar 

  2. Karatzas, D., et al.: ICDAR 2013 Robust reading competition. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 1484–1493, Washington, DC, USA (2013)

    Google Scholar 

  3. Karatzas, D., et al.: ICDAR 2015 robust reading competition. In: 13th International Conference on Document Analysis and Recognition, pp. 1156–1160, Tunis, Tunisia (2015)

    Google Scholar 

  4. Gupta, A., Vedaldi, A., Zisserman, A.: Synthetic data for text localisation in natural images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2315–2324 (2016)

    Google Scholar 

  5. Ch’ng, C.K., Chan, C.S., Liu, C.L.: Total-text: towards orientation robustness in scene text detection. Int. J. Doc. Anal. Recogn. 23, 31–52 (2020)

    Google Scholar 

  6. Liu, Y., **, L., Zhang, S., et al.: Curved scene text detection via transverse and longitudinal sequence connection. Pattern Recogn. 90, 337–345 (2019)

    Article  Google Scholar 

  7. Singh, A., Pang, G., Toh, M., et al.: Textocr: towards large-scale end-to-end reasoning for arbitrary-shaped scene text. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8802–8812 (2021)

    Google Scholar 

  8. Nayef, N., Yin, F., Bizid, I., et al.: ICDAR 2017 robust reading challenge on multi-lingual scene text detection and script identification-RRC-MLT. In: 14th International Conference on Document Analysis and Recognition, pp. 1454–1459 (2017)

    Google Scholar 

  9. Nayef, N., Patel, Y., Busta, M., et al.: ICDAR 2019 robust reading challenge on multi-lingual scene text detection and recognition-RRC-MLT-2019. In: 2019 International Conference on Document Analysis and Recognition, pp. 1582–1587(2019)

    Google Scholar 

  10. Rebuffi, S.A., Kolesnikov, A., Sperl, G., et al.: ICARL: Incremental classifier and representation learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2001–2010 (2017)

    Google Scholar 

  11. Long, S., Ruan, J., Zhang, W., et al.: Textsnake: a flexible representation for detecting text of arbitrary shapes. In: Proceedings of the European Conference on Computer Vision, pp. 20–36(2018)

    Google Scholar 

  12. Wang, W., **e, E., Song, X., et al.: Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 8440–8449 (2019)

    Google Scholar 

  13. Wang, W., **e, E., Li, X., et al.: Shape robust text detection with progressive scale expansion network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 9336–9345 (2019)

    Google Scholar 

  14. Liao, M., Wan, Z., Yao, C., et al.: Real-time scene text detection with differentiable binarization. Proc. AAAI Conf. Artif. Intell. 34(07), 11474–11481 (2020)

    Google Scholar 

  15. Liao, M., Zou, Z., Wan, Z., et al.: Real-time scene text detection with differentiable binarization and adaptive scale fusion. IEEE Trans. Pattern Anal. Mach. Intell. 45(1), 919–931 (2022)

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by Guangdong Basic and Applied Basic Research Foundation (No. 2021A1515011576), Guangdong, Hong Kong, Macao and the Greater Bay Area International Science and Technology Innovation Cooperation Project (No. 2021A050530080, No. 2021A0505060011), Key Research Projects for the Universities of Guangdong Provincial Education Department (No. 2020ZDZX3031, No. 2022ZDZX1032), Jiangmen Basic and Applied Basic Research Key Project (2021030103230006670), Jiangmen Science and Technology Plan Project (2220002000246), and Key Laboratory of Public Big Data in Guizhou Province (No. 2019BDKFJJ015), Development of a Container Intelligent Panoramic Trademark Quality Inspection System Based on Machine Vision(HX22105).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ying Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liang, Z. et al. (2024). Performance Evaluation of Container Identification Detection Algorithm. In: Kountchev, R., Patnaik, S., Nakamatsu, K., Kountcheva, R. (eds) Proceedings of International Conference on Artificial Intelligence and Communication Technologies (ICAICT 2023). ICAICT 2023. Smart Innovation, Systems and Technologies, vol 368. Springer, Singapore. https://doi.org/10.1007/978-981-99-6641-7_21

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-6641-7_21

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-6640-0

  • Online ISBN: 978-981-99-6641-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Navigation