Abstract
Control theory deals with the problem of finding a control law for a given dynamic system on which one can act by means of a command. The goal is then to bring the system from a given initial state to a certain final state, possibly respecting certain criteria. In this paper, we are interested in the problems of optimal control of dynamical systems, seen under the prism of reinforcement learning (RL). RL is an area of machine learning that enables an agent to learn in an interactive environment by trial and error using feedback from its own actions and experiences. We compare the Q-learning, a model-free reinforcement algorithm, to the feedback controller Linear Quadratic Regulator on the OpenAI Gym Cart-Pole system, a toy example which is quite famous in both the control engineering and RL community.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
R. Bellman, A markovian decision process. Journal of Mathematics and Mechanics 6(5), 679–684 (1957)
R. Bellman, Dynamic programming, in Dover Books on Computer Science (Dover Publications, New York, 2013)
C. Hua, Reinforcement Learning Aided Performance Optimization of Feedback Control Systems (Springer Vieweg, Wiesbaden, 2021)
V. Mnih, K. Kavukcuoglu, D. Silver, A.A. Rusu, J. Veness, M.G. Bellemare, A. Graves, M. Riedmiller, A.K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, D. Hassabis, Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
OpenAI Gym. Toolkit for develo** and comparing reinforcement learning algorithms. https://gym.openai.com/
N. Savinay, N. Podila, R Ugarakhod, G. Koshy, Comparison of reinforcement learning algorithms applied to the cart-pole problem, in International Conference on Advances in Computing, Communications and Informatics (ICACCI) (IEEE, New York, 2017)
R. Sutton, A. Barto, Reinforcement Learning : An Introduction (MIT Press, New York, 2018)
C.J. Watkins, P. Dayan, Q-learning. Mach. Learn. 8(3), 279–292 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ammari, K., Bel Mufti, G. (2023). Controlling a Dynamic System Through Reinforcement Learning. In: Ammari, K., Jammazi, C., Triki, F. (eds) Control and Inverse Problems. CIP 2022. Trends in Mathematics. Birkhäuser, Cham. https://doi.org/10.1007/978-3-031-35675-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-031-35675-9_2
Published:
Publisher Name: Birkhäuser, Cham
Print ISBN: 978-3-031-36334-4
Online ISBN: 978-3-031-35675-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)