Controlling a Dynamic System Through Reinforcement Learning

  • Conference paper
  • First Online:
Control and Inverse Problems (CIP 2022)

Part of the book series: Trends in Mathematics ((TM))

Included in the following conference series:

  • 178 Accesses

Abstract

Control theory deals with the problem of finding a control law for a given dynamic system on which one can act by means of a command. The goal is then to bring the system from a given initial state to a certain final state, possibly respecting certain criteria. In this paper, we are interested in the problems of optimal control of dynamical systems, seen under the prism of reinforcement learning (RL). RL is an area of machine learning that enables an agent to learn in an interactive environment by trial and error using feedback from its own actions and experiences. We compare the Q-learning, a model-free reinforcement algorithm, to the feedback controller Linear Quadratic Regulator on the OpenAI Gym Cart-Pole system, a toy example which is quite famous in both the control engineering and RL community.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (France)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 149.79
Price includes VAT (France)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
EUR 189.89
Price includes VAT (France)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free ship** worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. R. Bellman, A markovian decision process. Journal of Mathematics and Mechanics 6(5), 679–684 (1957)

    MathSciNet  MATH  Google Scholar 

  2. R. Bellman, Dynamic programming, in Dover Books on Computer Science (Dover Publications, New York, 2013)

    Google Scholar 

  3. C. Hua, Reinforcement Learning Aided Performance Optimization of Feedback Control Systems (Springer Vieweg, Wiesbaden, 2021)

    Google Scholar 

  4. V. Mnih, K. Kavukcuoglu, D. Silver, A.A. Rusu, J. Veness, M.G. Bellemare, A. Graves, M. Riedmiller, A.K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, D. Hassabis, Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)

    Article  Google Scholar 

  5. OpenAI Gym. Toolkit for develo** and comparing reinforcement learning algorithms. https://gym.openai.com/

  6. N. Savinay, N. Podila, R Ugarakhod, G. Koshy, Comparison of reinforcement learning algorithms applied to the cart-pole problem, in International Conference on Advances in Computing, Communications and Informatics (ICACCI) (IEEE, New York, 2017)

    Google Scholar 

  7. R. Sutton, A. Barto, Reinforcement Learning : An Introduction (MIT Press, New York, 2018)

    MATH  Google Scholar 

  8. C.J. Watkins, P. Dayan, Q-learning. Mach. Learn. 8(3), 279–292 (1992)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ammari, K., Bel Mufti, G. (2023). Controlling a Dynamic System Through Reinforcement Learning. In: Ammari, K., Jammazi, C., Triki, F. (eds) Control and Inverse Problems. CIP 2022. Trends in Mathematics. Birkhäuser, Cham. https://doi.org/10.1007/978-3-031-35675-9_2

Download citation

Publish with us

Policies and ethics

Navigation