Semi-direct Sparse Odometry with Robust and Accurate Pose Estimation for Dynamic Scenes

Wang, Wufan; Zhang, Lei

doi:10.1007/978-981-99-9666-7_9

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14250))

Included in the following conference series:

International Conference on Computer-Aided Design and Computer Graphics

165 Accesses

Abstract

The localization accuracy and robustness of visual odometry systems for static scenes can be significantly degraded in complex real-world environments with moving objects. This paper addresses the problem by proposing a semi-direct sparse visual odometry (SDSO) method designed for dynamic scenes. With the aid of the pixel-level semantic information, the system can not only eliminate dynamic points but also construct more accurate photometric errors for subsequent optimization. To obtain an accurate and robust camera pose in dynamic scenes, we propose a dual error optimization strategy that minimizes the reprojection and photometric errors consecutively. The proposed method has been extensively evaluated on the public datasets like the TUM dynamic dataset and KITTI dataset. The results demonstrate the effectiveness of our method in terms of localization accuracy and robustness compared with both the original direct sparse odometry (DSO) method and state-of-the-art methods for dynamic scenes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 53.49; Price includes VAT (Germany)

Softcover Book: EUR 70.61; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

R-SDSO: Robust stereo direct sparse odometry

Article 18 January 2022

Semi-independent Stereo Visual Odometry for Different Field of View Cameras

Stereo-RIVO: Stereo-Robust Indirect Visual Odometry

Article Open access 09 July 2024

References

Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)
Article Google Scholar
Bescos, B., Campos, C., Tardós, J.D., Neira, J.: Dynaslam ii: tightly-coupled multi-object tracking and slam. IEEE Robot. Autom. Lett. 6(3), 5191–5198 (2021)
Article Google Scholar
Bescos, B., Fácil, J.M., Civera, J., Neira, J.: Dynaslam: tracking, map**, and inpainting in dynamic scenes. IEEE Robot. Autom. Lett. 3(4), 4076–4083 (2018)
Article Google Scholar
Chen, W., Fang, M., Liu, Y.H., Li, L.: Monocular semantic slam in dynamic street scene based on multiple object tracking. In: Proceedings of the IEEE International Conference on Cybernetics and Intelligent Systems, pp. 599–604 (2017)
Google Scholar
Du, Z.J., Huang, S.S., Mu, T.J., Zhao, Q., Martin, R.R., Xu, K.: Accurate dynamic SLAM using CRF-based long-term consistency. IEEE Trans. Vis. Comput. Graph. 28(4), 1745–1757 (2022)
Article Google Scholar
Engel, J., Koltun, V., Cremers, D.: Direct sparse odometry. IEEE Trans. Pattern Anal. Mach. Intell. 40(3), 611–625 (2018)
Article Google Scholar
Gao, X., Wang, R., Demmel, N., Cremers, D.: LDSO: direct sparse odometry with loop closure. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 2198–2204 (2018)
Google Scholar
Geiger, A., Lenz, P., Stiller, C., Urtasun, R.: Vision meets robotics: the KITTI dataset. Int. J. Robot. Res. 32(11), 1231–1237 (2013)
Article Google Scholar
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 386–397 (2020)
Article Google Scholar
Engel, J., Schöps, T., Cremers, D.: LSD-SLAM: large-scale direct monocular SLAM. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8690, pp. 834–849. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10605-2_54
Chapter Google Scholar
Kim, D.H., Kim, J.H.: Effective background model-based RGB-D dense visual odometry in a dynamic environment. IEEE Trans. Robot. 32(6), 1565–1573 (2016)
Article Google Scholar
Mohamed Chafik, B., Majdi, A., Ezzeddine, Z.: Dense 3D SLAM in dynamic scenes using kinect. In: Proceedings of the Pattern Recognition and Image Analysis, pp. 121–129 (2015)
Google Scholar
Mur-Artal, R., Tardós, J.D.: ORB-SLAM2: an open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Trans. Robot. 33(5), 1255–1262 (2017)
Article Google Scholar
Palazzolo, E., Behley, J., Lottes, P., Giguère, P., Stachniss, C.: Refusion: 3D reconstruction in dynamic environments for RGB-D cameras exploiting residuals. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 7855–7862 (2019)
Google Scholar
Bahraini, M.S., Bozorg, M., Rad, A.B.: Slam in dynamic environments via ML-RANSAC. Mechatronics 49, 105–118 (2018)
Google Scholar
Scona, R., Jaimez, M., Petillot, Y.R., Fallon, M., Cremers, D.: Staticfusion: background reconstruction for dense RGB-D SLAM in dynamic environments. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 3849–3856 (2018)
Google Scholar
Sheng, C., Pan, S., Gao, W., Tan, Y., Zhao, T.: Dynamic-DSO: direct sparse odometry using objects semantic information for dynamic environments. Appl. Sci. 10(4), 1467 (2020)
Article Google Scholar
Shi, J.: Good features to track. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 593–600 (1994). Tomasi
Google Scholar
Sturm, J., Engelhard, N., Endres, F., Burgard, W., Cremers, D.: A benchmark for the evaluation of RGB-D SLAM systems. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 573–580 (2012)
Google Scholar
Sun, Y., Liu, M., Meng, Q.: Improving RGB-D SLAM in dynamic environments: a motion removal approach. Robot. Auton. Syst. 89, 110–122 (2017)
Article Google Scholar
Teed, Z., Deng, J.: Droid-SLAM: deep visual SLAM for monocular, stereo, and RGB-D cameras (2021)
Google Scholar
Wen, S., Li, P., Zhao, Y., Zhang, H., Sun, F., Wang, Z.: Semantic visual slam in dynamic environment. Auton. Robot. 45(4), 493–504 (2021)
Article Google Scholar
** based on deep learning in dynamic environment. Robot. Auton. Syst. 117, 1–16 (2019)
Article Google Scholar
Younes, G., Asmar, D., Zelek, J.: A unified formulation for visual odometry. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 6237–6244 (2019)
Google Scholar
Yu, C., et al.: DS-SLAM: a semantic visual SLAM towards dynamic environments. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 1168–1174 (2018)
Google Scholar
Yu, M.F., Zhang, L., Wang, W.F., Wang, J.H.: SCP-SLAM: accelerating DynaSLAM with static confidence propagation. In: IEEE Conference Virtual Reality and 3D User Interfaces (VR), pp. 509–518 (2023)
Google Scholar
Zhang, L., Wei, L., Shen, P., Wei, W., Zhu, G., Song, J.: Semantic slam based on object detection and improved octomap. IEEE Access 6, 75545–75559 (2018)
Article Google Scholar
Zhong, F., Wang, S., Zhang, Z., Chen, C., Wang, Y.: Detect-SLAM: making object detection and SLAM mutually beneficial. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision, pp. 1001–1010 (2018)
Google Scholar
Zou, D., Tan, P.: Coslam: collaborative visual SLAM in dynamic environments. IEEE Trans. Pattern Anal. Mach. Intell. 35(2), 354–366 (2013)
Article Google Scholar

Download references

Acknowledgement

This work is supported by the National Natural Science Foundation of China (No. 62132012 and No. 62002020).

Author information

Authors and Affiliations

Bei**g Institute of Technology, Bei**g, 100081, China
Wufan Wang & Lei Zhang

Authors

Wufan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Lei Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lei Zhang .

Editor information

Editors and Affiliations

Tsinghua University, Bei**g, China
Shi-Min Hu
Nanyang Technological University, Singapore, Singapore
Yiyu Cai
Cardiff University, Cardiff, UK
Paul Rosin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, W., Zhang, L. (2024). Semi-direct Sparse Odometry with Robust and Accurate Pose Estimation for Dynamic Scenes. In: Hu, SM., Cai, Y., Rosin, P. (eds) Computer-Aided Design and Computer Graphics. CADGraphics 2023. Lecture Notes in Computer Science, vol 14250. Springer, Singapore. https://doi.org/10.1007/978-981-99-9666-7_9

Download citation

DOI: https://doi.org/10.1007/978-981-99-9666-7_9
Published: 07 February 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-9665-0
Online ISBN: 978-981-99-9666-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)

Semi-direct Sparse Odometry with Robust and Accurate Pose Estimation for Dynamic Scenes

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

R-SDSO: Robust stereo direct sparse odometry

Semi-independent Stereo Visual Odometry for Different Field of View Cameras

Stereo-RIVO: Stereo-Robust Indirect Visual Odometry

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Subscribe and save

Buy Now

Navigation

Semi-direct Sparse Odometry with Robust and Accurate Pose Estimation for Dynamic Scenes

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

R-SDSO: Robust stereo direct sparse odometry

Semi-independent Stereo Visual Odometry for Different Field of View Cameras

Stereo-RIVO: Stereo-Robust Indirect Visual Odometry

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation