Training a Four Legged Robot via Deep Reinforcement Learning and Multibody Simulation

Benatti, Simone; Tasora, Alessandro; Mangoni, Dario

doi:10.1007/978-3-030-23132-3_47

Simone Benatti⁴,
Alessandro Tasora⁴ &
Dario Mangoni⁴

Part of the book series: Computational Methods in Applied Sciences ((COMPUTMETHODS,volume 53))

Included in the following conference series:

European Congress on Computational Methods in Applied Sciences and Engineering

1949 Accesses
2 Citations

Abstract

In this paper we use the Proximal Policy Optimization (PPO) deep reinforcement learning algorithm to train a Neural Network to control a four-legged robot in simulation. Reinforcement learning in general can learn complex behavior policies from simple state-reward tuples datasets and PPO in particular has proved its effectiveness in solving complex tasks with continuous states and actions. Moreover, since it is model-free, it is general and can adapt to changes in the environment or in the robot itself.

The virtual environment used to train the agent was modeled using our physics engine Project Chrono. Chrono can handle non smooth dynamics simulation allowing us to introduce stiff leg-ground contacts and using its Python interface Pychrono it can be interfaced with the Machine Leaning framework TensorFlow with ease. We trained the Neural Network until it learned to control the motor torques, then various policy Neural Network input state choices have been compared.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: EUR 29.95; Price includes VAT (Germany)

eBook: EUR 160.49; Price includes VAT (Germany)

Softcover Book: EUR 213.99; Price includes VAT (Germany)

Hardcover Book: EUR 213.99; Price includes VAT (Germany)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Learning and Transfer of Movement Gaits Using Reinforcement Learning

Towards Generating Simulated Walking Motion Using Position Based Deep Reinforcement Learning

Model-Based Policy Optimization with Neural Differential Equations for Robotic Arm Control

References

Brockman, G., Cheung, V., Pettersson, L., Schneider, J., Schulman, J., Tang, J., Zaremba, W.: Openai gym. CoRR, abs/1606.01540 (2016)
Google Scholar
Heess, N., Dhruva, T.B., Sriram, S., Lemmon, J., Merel, J., Wayne, G., Tassa, Y., Erez, T., Wang, Z., Eslami, S.M.A., Riedmiller, M.A., Silver, D.: Emergence of locomotion behaviours in rich environments. CoRR, abs/1707.02286 (2017)
Google Scholar
LeCun, Y.A., Bottou, L., Orr, G.B., Müller, K.-R.: Efficient BackProp, pp. 9–48. Springer, Heidelberg (2012)
Google Scholar
Levine, S., Pastor, P., Krizhevsky, A., Quillen, D.: Learning hand-eye coordination for robotic gras** with deep learning and large-scale data collection. CoRR, abs/1603.02199 (2016)
Google Scholar
OpenAI., Andrychowicz, M., Baker, B., Chociej, M., Jozefowicz, R., McGrew, B., Pachocki, J., Petron, A., Plappert, M., Powell, G., Ray, A., Schneider, J., Sidor, S., Tobin, J., Welinder, P., Weng, L., Zaremba, W.: Learning dexterous in-hand manipulation (2018)
Google Scholar
Schulman, J., Levine, S., Moritz, P., Jordan, M.I., Abbeel, P.: Trust region policy optimization. CoRR, abs/1502.05477 (2015)
Google Scholar
Schulman, J., Moritz, P., Levine, S., Jordan, M.I., Abbeel, P.: High-dimensional continuous control using generalized advantage estimation. CoRR, abs/1506.02438 (2015)
Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms. CoRR, abs/1707.06347 (2017)
Google Scholar
Sutton, R.S., Barto, A.G.: Introduction to Reinforcement Learning, 1st edn. MIT Press, Cambridge (1998)
MATH Google Scholar
Tasora, A., Serban, R., Mazhar, H., Pazouki, A., Melanz, D., Fleischmann, J., Taylor, M., Sugiyama, H., Negrut, D.: Chrono: an open source multi-physics dynamics engine. In: HPCSE (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Engineering and Architecture, Università degli Studi di Parma, Parco Area delle Scienze, 181/A, 43124, Parma, Italy
Simone Benatti, Alessandro Tasora & Dario Mangoni

Authors

Simone Benatti
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Tasora
View author publications
You can also search for this author in PubMed Google Scholar
Dario Mangoni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Simone Benatti .

Editor information

Editors and Affiliations

Lehrstuhl für Mechanik und Robotik, Universität Duisburg-Essen, Duisburg, Nordrhein-Westfalen, Germany
Andrés Kecskeméthy
Lehrstuhl für Mechanik und Robotik, Universität Duisburg-Essen, Duisburg, Nordrhein-Westfalen, Germany
Francisco Geu Flores

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Benatti, S., Tasora, A., Mangoni, D. (2020). Training a Four Legged Robot via Deep Reinforcement Learning and Multibody Simulation. In: Kecskeméthy, A., Geu Flores, F. (eds) Multibody Dynamics 2019. ECCOMAS 2019. Computational Methods in Applied Sciences, vol 53. Springer, Cham. https://doi.org/10.1007/978-3-030-23132-3_47

Download citation

DOI: https://doi.org/10.1007/978-3-030-23132-3_47
Published: 28 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-23131-6
Online ISBN: 978-3-030-23132-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Training a Four Legged Robot via Deep Reinforcement Learning and Multibody Simulation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Learning and Transfer of Movement Gaits Using Reinforcement Learning

Towards Generating Simulated Walking Motion Using Position Based Deep Reinforcement Learning

Model-Based Policy Optimization with Neural Differential Equations for Robotic Arm Control

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Training a Four Legged Robot via Deep Reinforcement Learning and Multibody Simulation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Learning and Transfer of Movement Gaits Using Reinforcement Learning

Towards Generating Simulated Walking Motion Using Position Based Deep Reinforcement Learning

Model-Based Policy Optimization with Neural Differential Equations for Robotic Arm Control

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation