Overview of 3D Object Detection for Robot Environment Perception

  • Conference paper
  • First Online:
Proceedings of International Conference on Image, Vision and Intelligent Systems 2023 (ICIVIS 2023) (ICIVIS 2023)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1163))

Included in the following conference series:

  • 330 Accesses

Abstract

Environmental perception is one of the key issues in robot research. To complete more precise tasks, robots need a recognition ability to recognize all dynamic and static objects in the environment accurately. Currently, the prevailing approach in robotics is to employ object detection methods for perceiving stationary objects in the robot's vicinity. This primarily entails the use of 2D object detection based on images and 3D object detection based on depth images or point cloud data. From an input data perspective, it is evident that depth images and point cloud data offer a substantially richer set of three-dimensional spatial information compared to conventional image data. Therefore, 3D object detection is more suitable for robot environmental perception. This article provides an overview of 3D object detection methods for facial robots. This article mainly reviews methods based on radar point cloud information, introduces the historical evolution of 3D object detection, and elaborates on the performance and limitations of different methods. At the same time, it lists commonly used datasets for related tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ko, D.-K., Lee, K.-W., Lee, D.H., Lim, S.-C.: Vision-based interaction force estimation for robot grip motion without tactile/force sensor. Expert Syst. Appl. 211, 118441 (2023). https://doi.org/10.1016/j.eswa.2022.118441

    Article  Google Scholar 

  2. Wang, J., et al.: Field effect transistor-based tactile sensors: from sensor configurations to advanced applications. InfoMat. 5, e12376 (2023). https://doi.org/10.1002/inf2.12376

    Article  Google Scholar 

  3. Liu, L., et al.: MixTeacher: Mining promising labels with mixed scale teacher for semi-supervised object detection. In: Presented at the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2023)

    Google Scholar 

  4. Gonçalves, C.B., Souza, J.R., Fernandes, H.: CNN architecture optimization using bio-inspired algorithms for breast cancer detection in infrared images. Comput. Biol. Med. 142, 105205 (2022). https://doi.org/10.1016/j.compbiomed.2021.105205

    Article  Google Scholar 

  5. Abumalloh, R.A., et al.: Medical image processing and COVID-19: aA literature review and bibliometric analysis. J. Infect. Public Health 15, 75–93 (2022). https://doi.org/10.1016/j.jiph.2021.11.013

    Article  Google Scholar 

  6. He, C., Li, R., Zhang, Y., Li, S., Zhang, L.: MSF: motion-guided sequential fusion for efficient 3d object detection from point cloud sequences. In: Presented at the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2023)

    Google Scholar 

  7. Qi, C.R., Su, H., Mo, K., Guibas, L.J.: PointNet: deep learning on point sets for 3d classification and segmentation. In: Presented at the Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)

    Google Scholar 

  8. Feng, C., Jie, Z., Zhong, Y., Chu, X., Ma, L.: AeDet: azimuth-invariant multi-view 3d object detection. In: Presented at the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2023)

    Google Scholar 

  9. Qi, C.R., Yi, L., Su, H., Guibas, L.J.: PointNet++: dDeep hierarchical feature learning on point sets in a metric space. In: Advances in Neural Information Processing Systems. Curran Associates, Inc. (2017)

    Google Scholar 

  10. Shi, S., Wang, Z., Shi, J., Wang, X., Li, H.: From points to parts: 3D object detection from point cloud with part-aware and part-aggregation network. IEEE Trans. Pattern Anal. Mach. Intell. 43, 2647–2664 (2021). https://doi.org/10.1109/TPAMI.2020.2977026

    Article  Google Scholar 

  11. Yang, Z., Sun, Y., Liu, S., Shen, X., Jia, J.: STD: sparse-to-dense 3D object detector for point cloud. In: Presented at the Proceedings of the IEEE/CVF International Conference on Computer Vision (2019)

    Google Scholar 

  12. Vaswani, A., et al/: Attention is all you need. In: Advances in Neural Information Processing Systems. Curran Associates, Inc. (2017)

    Google Scholar 

  13. Pan, X., **a, Z., Song, S., Li, L.E., Huang, G.: 3D object detection with pointformer. In: Presented at the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2021)

    Google Scholar 

  14. Shi, S., Wang, Z., Wang, X., Li, H.: Part-A^2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud. (2019)

    Google Scholar 

  15. Lang, A.H., Vora, S., Caesar, H., Zhou, L., Yang, J., Beijbom, O.: PointPillars: fast encoders for object detection from point clouds. In: Presented at the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2019)

    Google Scholar 

  16. Li, Y., et al.: Voxel Field Fusion for 3D Object Detection. In: Presented at the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022)

    Google Scholar 

Download references

Acknowledgement

This work is supported by the Bei**g Natural Science Foundation (No. 4222025), the National Natural Science Foundation of China (No. 61931012), the Bei**g Municipal Science and Technology (No. Z221100000222016), and QIYUAN LAB Innovation Foundation (Innovation Research) Project (No. S20210201107).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nan Ma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, M., Ma, N. (2024). Overview of 3D Object Detection for Robot Environment Perception. In: You, P., Liu, S., Wang, J. (eds) Proceedings of International Conference on Image, Vision and Intelligent Systems 2023 (ICIVIS 2023). ICIVIS 2023. Lecture Notes in Electrical Engineering, vol 1163. Springer, Singapore. https://doi.org/10.1007/978-981-97-0855-0_64

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0855-0_64

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0854-3

  • Online ISBN: 978-981-97-0855-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Navigation