ResRnnNet: Learning to Reconstruct the 3D Room Layout from a Single RGB Panorama

Zhao, Shaonan; Li, Wei

doi:10.1007/978-981-19-1057-9_31

Shaonan Zhao⁸ &
Wei Li⁹

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 277))

282 Accesses

Abstract

3D room layout reconstruction from a single RGB panoramic image has been an emerging research topic in recent years. To achieve better prediction accuracy, in this paper, we propose a new approach to predict 3D room layout from a single panoramic image. Our reconstruction flow follows a common framework which is same as LayoutNet [9] and HorizonNet [4]; however, we redesign a new deep learning architecture with recurrent neural networks (RNNs) encoder–decoder as an extension for keypoints refinement and use a gradient ascent optimization algorithm to minimize the similar loss. Experiments on both cuboid-shaped and general Manhattan layouts show that the proposed work outperforms recent algorithms in prediction accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 199.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 249.99; Price includes VAT (United Kingdom)

Hardcover Book: GBP 249.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

DeepRoom: 3D Room Layout and Pose Estimation from a Single Image

AtlantaNet: Inferring the 3D Indoor Layout from a Single $$360^\circ $$ Image Beyond the Manhattan World Assumption

3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform

References

Fernandez-Labrador, C., Facil, J.M., Perez-Yus, A., Demonceaux, C., Guerrero, J.: Corners for layout: end-to-end layout recovery from 360 images. IEEE Robot. Autom. Lett. 5(2), 1255–1262 (2020)
Article Google Scholar
Gioi, R., Jakubowicz, J., Morel, J.M., Randall, G.: LSD: a fast line segment detector with a false detection control (2008)
Google Scholar
Lee, C.Y., Badrinarayanan, V., Malisiewicz, T., Rabinovich, A.: Roomnet: end-to-end room layout estimation. In: 2017 IEEE International Conference on Computer Vision (ICCV) (2017)
Google Scholar
Sun, C., Hsiao, C.W., Sun, M., Chen, H.T.: Horizonnet: learning room layout with 1d representation and pano stretch data augmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1047–1056 (2019)
Google Scholar
Yang, S.T., Wang, F.E., Peng, C.H., Wonka, P., Sun, M., Chu, H.K.: Dula-net: a dual-projection network for estimating room layouts from a single RGB panorama. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3363–3372 (2019)
Google Scholar
Zhang, Y., Song, S., **, T., **ao, J.: Panocontext: a whole-room 3D context model for panoramic scene understanding. In: European Conference on Computer Vision (2014)
Google Scholar
Zhang, Y., Song, S., Tan, P., **ao, J.: Panocontext: a whole-room 3D context model for panoramic scene understanding. In: European Conference on Computer Vision, pp. 668–686. Springer (2014)
Google Scholar
Zou, C., Su, J.W., Peng, C.H., Colburn, A., Shan, Q., Wonka, P., Chu, H.K., Hoiem, D.: 3D Manhattan room layout reconstruction from a single 360 image (2019)
Google Scholar
Zou, C., Colburn, A., Shan, Q., Hoiem, D.: Layoutnet: reconstructing the 3D room layout from a single RGB image. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2051–2059 (2018)
Google Scholar

Download references

Acknowledgements

Wei Li is supported by SCUT XK2060021005.

Author information

Authors and Affiliations

School of Civil Engineering and Environment, The University of New South Wales, Sydney, NSW, 2052, Australia
Shaonan Zhao
College of Computer Science and Technology, Harbin Engineering University, Harbin, 150001, China
Wei Li

Authors

Shaonan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Wei Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Li .

Editor information

Editors and Affiliations

College of Computer Science and Engineering, Shandong University of Science and Technology, Shandong, China
Shu-Chuan Chu
Shu-Te University, Kaohsiung, Taiwan
Shi-Huang Chen
Fujian University of Technology, Fuzhou, China
Zhenyu Meng
Database, Bioinformatics Lab, Chungbuk National University, Cheongju, Korea (Republic of)
Keun Ho Ryu
University of Piraeus, Piraeus, Greece
George A. Tsihrintzis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, S., Li, W. (2022). ResRnnNet: Learning to Reconstruct the 3D Room Layout from a Single RGB Panorama. In: Chu, SC., Chen, SH., Meng, Z., Ryu, K.H., Tsihrintzis, G.A. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. Smart Innovation, Systems and Technologies, vol 277. Springer, Singapore. https://doi.org/10.1007/978-981-19-1057-9_31

Download citation

DOI: https://doi.org/10.1007/978-981-19-1057-9_31
Published: 06 July 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-1056-2
Online ISBN: 978-981-19-1057-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

ResRnnNet: Learning to Reconstruct the 3D Room Layout from a Single RGB Panorama

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

DeepRoom: 3D Room Layout and Pose Estimation from a Single Image

AtlantaNet: Inferring the 3D Indoor Layout from a Single $$360^\circ $$ Image Beyond the Manhattan World Assumption

3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

ResRnnNet: Learning to Reconstruct the 3D Room Layout from a Single RGB Panorama

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

DeepRoom: 3D Room Layout and Pose Estimation from a Single Image

AtlantaNet: Inferring the 3D Indoor Layout from a Single $$360^\circ $$ Image Beyond the Manhattan World Assumption

3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation