A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment

Fan, Duyao; Yao, Yazhou; Cai, Yunfei; Shu, **angbo; Huang, Pu; Yang, Wankou

doi:10.1007/978-3-030-60633-6_17

Duyao Fan¹⁶,
Yazhou Yao¹⁶,
Yunfei Cai¹⁶,
**angbo Shu¹⁶,
Pu Huang¹⁷ &
…
Wankou Yang¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12305))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

2594 Accesses

Abstract

To ameliorate the problems of disorder, sparseness, and floating occur for 3D LiDAR point cloud in the road environment, we propose a novel deep CNN architecture for real-time point cloud features extraction. Specifically, we first code the 3D position of point cloud by the index of vertical and horizontal directions. In this way, the 3D point cloud can be converted into a multi-channel point feature map. Then, through multi-level features extraction and fusion of the point feature map, the semantic segmentation of the point cloud scene is finally realized. Comprehensive experiments and ablation studies on public available point cloud datasets demonstrate the superiority of our approach. More importantly, our approach has been successfully applied to the perception of the real-world self-driving system. The source code has been made public available at: https://github.com/Lab1028-19/A-Novel-CNN.

This work was supported by the National Natural Science Foundation of China (No. 61976116, 61773117), Fundamental Research Funds for the Central Universities (No. 30920021135), and the Primary Research & Development Plan of Jiangsu Province - Industry Prospects and Common Key Technologies (No. BE2017157).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 71.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 89.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

F-3DNet: Extracting inner order of point cloud for 3D object detection in autonomous driving

Article 18 June 2023

ODSPC: deep learning-based 3D object detection using semantic point cloud

Article 18 March 2023

Deep learning based computer vision under the prism of 3D point clouds: a systematic review

Article Open access 29 January 2024

References

Yao, Y., et al.: Towards automatic construction of diverse, high-quality image dataset. IEEE Trans. Knowl. Data Eng. 32(6), 1199–1211 (2020)
Article Google Scholar
Lu, J., et al.: HSI road: a hyper spectral image dataset for road segmentation, vol. 1–6 (2020)
Google Scholar
Hua, X., et al.: A new web-supervised method for image dataset constructions. Neurocomputing 236, 23–31 (2017)
Article Google Scholar
Yao, Y., et al.: Exploiting web images for dataset construction: a domain robust approach. IEEE Trans. Multimed. 19(8), 1771–1784 (2017)
Article Google Scholar
Zhang, J., et al.: Extracting visual knowledge from the internet: making sense of image data, vol. 862–873 (2016)
Google Scholar
Shen, F., et al.: Automatic image dataset construction with multiple textual metadata, vol. 1–6 (2016)
Google Scholar
Yao, Y., et al.: A domain robust approach for image dataset construction. In: ACM International conference on Multimedia, pp. 212–216 (2016)
Google Scholar
Yao, Y., et al.: Bridging the web data and fine-grained visual recognition via alleviating label noise and domain mismatch. In: ACM International Conference on Multimedia (2020)
Google Scholar
Sun, Z., et al.: CRSSC: salvage reusable samples from noisy data for robust learning. In: ACM International Conference on Multimedia (2020)
Google Scholar
Zhang, C., et al.: Data-driven meta-set based fine-grained visual recognition. In: ACM International Conference on Multimedia (2020)
Google Scholar
Liu, H., et al.: Road segmentation with image-LiDAR data fusion in deep neural network. Multimed. Tools Appl. (2019)
Google Scholar
Han, X., et al.: Deep representation learning for road detection using siamese network. Multimed. Tools Appl. (2019)
Google Scholar
Zhou, T., et al.: Motion-attentive transition for zero-shot video object segmentation. In: AAAI Conference on Artificial Intelligence (2020)
Google Scholar
Luo, H., et al.: SegEQA: video segmentation based visual attention for embodied question answering. In: IEEE Conference on Computer Vision, pp. 9667–9676 (2019)
Google Scholar
Wang, W., et al.: Target-aware adaptive tracking for unsupervised video object segmentation. In: The DAVIS Challenge on Video Object Segmentation on CVPR Workshop (2020)
Google Scholar
Kirschner, U.: Urban transdisciplinary co-study in a cooperative multicultural working project. In: Luo, Y. (ed.) CDVE 2018. LNCS, vol. 11151, pp. 145–152. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00560-3_20
Chapter Google Scholar
Yao, Y., et al.: Exploiting web images for multi-output classification: from category to subcategories. IEEE Trans. Neural Netw. Learn. Syst. 31(7), 2348–2360 (2020)
Google Scholar
Xu, M., et al.: Deep learning for person reidentification using support vector machines. Adv. Multimed. (2017)
Google Scholar
Gu, Y., et al.: Clustering-driven unsupervised deep hashing for image retrieval. Neurocomputing 368, 114–123 (2019)
Article Google Scholar
Wang, W., et al.: Set and rebase: determining the semantic graph connectivity for unsupervised cross modal hashing. In: International Joint Conference on Artificial Intelligence, pp. 853–859 (2020)
Google Scholar
Hu, B., et al.: PyRetri: a PyTorch-based library for unsupervised image retrieval by deep convolutional neural networks. ar**v (2020)
Google Scholar
Zhang, C., et al.: Web-supervised network with softly update-drop training for fine-grained visual classification. In: AAAI Conference on Artificial Intelligence, pp. 12781–12788 (2020)
Google Scholar
Yao, Y., et al.: Extracting privileged information for enhancing classifier learning. IEEE Trans. Image Process. 28(1), 436–450 (2019)
Article MathSciNet Google Scholar
**e, G., et al.: Attentive region embedding network for zero-shot learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 9384–9393 (2019)
Google Scholar
Shu, X., et al.: Hierarchical long short-term concurrent memory for human interaction recognition. IEEE TPAMI (2019)
Google Scholar
**e, G.-S., et al.: SRSC: selective, robust, and supervised constrained feature representation for image classification. IEEE Trans. Neural Netw. Learn. Syst. (2019)
Google Scholar
Shu, X., et al.: Personalized Age Progression with Bi-level Aging Dictionary Learning. IEEE Trans. Pattern Anal. Mach. Intell. (2018)
Google Scholar
Yao, Y., et al.: Extracting multiple visual senses for web learning. IEEE Trans. Multimed. 21(1), 184–196 (2019)
Article Google Scholar
Zhang, J., et al.: Extracting privileged information from untagged corpora for classifier learning. In: International Joint Conference on Artificial Intelligence, pp. 1085–1091 (2018)
Google Scholar
Zhang, C., et al.: Web-supervised network for fine-grained visual classification, vol. 1–6 (2020)
Google Scholar
Chen, T., et al.: Classification constrained discriminator for domain adaptive semantic segmentation, vol. 1–6 (2020)
Google Scholar
Yang, W., et al.: Exploiting textual and visual features for image categorization. Pattern Recogn. Lett. 117, 140–145 (2019)
Article Google Scholar
Huang, P., et al.: Collaborative Representation Based Local Discriminant Projection for Feature Extraction. Digit. Signal Proc. 76, 84–93 (2018)
Article MathSciNet Google Scholar
Zhou, S.Y., et al.: Study on method of road detection in vehicle detection and tracking system. Electron. Des. Eng. 20(2), 157–162 (2014)
Google Scholar
Liu, Y., et al.: Unstructured road-detection algorithm based on multiple models and optimization. Gongcheng Sheji Xuebao 20(2), 157–162 (2013)
Google Scholar
Gang, J.: Point cloud hole filling method based on SVM and space projection. Comput. Eng. 35(22), 269–271 (2009)
Google Scholar
Bai, M., et al.: Road detection method based on graph model. Pattern Recog. Artif. Intell. 27, 655–62 (2014)
Google Scholar
Wijesoma, W.S., et al.: Road-boundary detection and tracking using ladar sensing. IEEE Trans. Robot. Autom. 20(3), 456–464 (2004)
Article Google Scholar
Guo, Q., et al.: Unstructured road detection based on two-dimensional entropy and contour features. J. Comput. Appl. (7), 56 (2013)
Google Scholar
Zhu, X., et al.: A real-time road boundary detection algorithm based on driverless cars. Electrical, Electronics and Computer Engineering (2015)
Google Scholar
Gong, J.W., et al.: Unstructured road recognition using self-supervised multilayer perceptron online learning algorithm. Trans. Bei**g Inst. Technol. 34(3), 261–266 (2014)
Google Scholar
Zhou, S.Y., et al.: Road detection using support vector machine based on online learning and evaluation. In: IEEE Intelligent Vehicles Symposium, pp. 256–261 (2010)
Google Scholar
Wang, X.B., et al.: Unstructured road detection based on support vector machine. Sci. Technol. Eng. 11, 9106–9109 (2011)
Google Scholar
Charles, R.Q., et al.: Pointnet: deep learning on point sets for 3D classification and segmentation. In: IEEE Conference On Computer Vision and Pattern Recognition, pp. 652–660 (2017)
Google Scholar
Huang, X., et al.: The apolloscape open dataset for autonomous driving and its application. IEEE TPAMI (2019)
Google Scholar
Geiger, A., et al.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: IEEE Conference on Computer Vision and Pattern Recognition (2012)
Google Scholar
He, K., et al.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Velas, M., et al.: CNN for very fast ground segmentation in velodyne lidar data, vol. 97–103 (2018)
Google Scholar
Sun, Z., et al.: Dynamically visual disambiguation of keyword-based image search. In: International Joint Conference on Artificial Intelligence, pp. 996–1002 (2019)
Google Scholar
Yang, W., et al.: Discovering and Distinguishing Multiple Visual Senses for Polysemous Words. In: AAAI Conference on Artificial Intelligence, pp. 523–530 (2018)
Google Scholar
Ding, L., et al.: Approximate kernel selection via matrix approximation. IEEE Trans. Neural Netw. Learn. Syst. PP(99), 1–11 (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Nan**g University of Science and Technology, Nan**g, 210094, China
Duyao Fan, Yazhou Yao, Yunfei Cai & **angbo Shu
Nan**g Audit University, Nan**g, 211815, China
Pu Huang
Southeast University, Nan**g, 210096, China
Wankou Yang

Authors

Duyao Fan
View author publications
You can also search for this author in PubMed Google Scholar
Yazhou Yao
View author publications
You can also search for this author in PubMed Google Scholar
Yunfei Cai
View author publications
You can also search for this author in PubMed Google Scholar
**angbo Shu
View author publications
You can also search for this author in PubMed Google Scholar
Pu Huang
View author publications
You can also search for this author in PubMed Google Scholar
Wankou Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yunfei Cai .

Editor information

Editors and Affiliations

Peking University, Bei**g, China
Yuxin Peng
Nan**g University of Information Science and Technology, Nan**g, China
Qingshan Liu
Dalian University of Technology, Dalian, China
Huchuan Lu
Chinese Academy of Sciences, Bei**g, China
Zhenan Sun
Chinese Academy of Sciences, Bei**g, China
Chenglin Liu
Institute of Computing Technology, Chinese Academy of Sciences, Bei**g, China
**lin Chen
Peking University, Bei**g, China
Hongbin Zha
Nan**g University of Science and Technology, Nan**g, China
Jian Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fan, D., Yao, Y., Cai, Y., Shu, X., Huang, P., Yang, W. (2020). A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment. In: Peng, Y., et al. Pattern Recognition and Computer Vision. PRCV 2020. Lecture Notes in Computer Science(), vol 12305. Springer, Cham. https://doi.org/10.1007/978-3-030-60633-6_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-60633-6_17
Published: 11 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60632-9
Online ISBN: 978-3-030-60633-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

F-3DNet: Extracting inner order of point cloud for 3D object detection in autonomous driving

ODSPC: deep learning-based 3D object detection using semantic point cloud

Deep learning based computer vision under the prism of 3D point clouds: a systematic review

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Novel CNN Architecture for Real-Time Point Cloud Recognition in Road Environment

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

F-3DNet: Extracting inner order of point cloud for 3D object detection in autonomous driving

ODSPC: deep learning-based 3D object detection using semantic point cloud

Deep learning based computer vision under the prism of 3D point clouds: a systematic review

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation