Abstract
Optimal control is crucial for spacecraft missions, with limited spacecraft fuel or energy storage and few opportunities for refueling after launch. The performance guaranteed control can provide transient or steady-state performance constraints for spacecraft closed-loop systems and even the settling time upper bounds. However, it cannot guarantee the optimization of the control inputs. To optimize the control input under the condition of satisfying the performance constraints, considering the optimal control method based on adaptive dynamic programming (ADP), this chapter proposes the spacecraft performance guaranteed optimal control and its improved algorithm based on experience replay. Detailed theoretical derivation, analysis, and rigorous theoretical proofs of the algorithms are given. By applying these algorithms, the transient and steady-state behavior of the spacecraft closed-loop system can meet the desired constraints, and the control input can be optimized when the constraints are considered. Finally, comprehensive numerical simulation analysis and comparative simulation verify the effectiveness, feasibility, and advantages of the proposed algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abu-Khalaf, M., Lewis, F.L.: Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network hjb approach. Automatica 41(5), 779–791 (2005)
Finlayson, B.A.: The method of weighted residuals and variational principles. SIAM (2013)
Hornik, K., Stinchcombe, M., White, H.: Universal approximation of an unknown map** and its derivatives using multilayer feedforward networks. Neural networks 3(5), 551–560 (1990)
Vamvoudakis, K.G., Lewis, F.L.: Online actor–critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 46(5), 878–888 (2010)
Liu, D., Li, H., Wang, D.: Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics. IEEE Transactions on Systems, Man, and Cybernetics: Systems 44(8), 1015–1027 (2014)
Liu, D., Li, H., Wang, D.: Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics. IEEE Transactions on Systems, Man, and Cybernetics: Systems 44(8), 1015–1027 (2014). 10.1109/TSMC.2013.2295351
Vamvoudakis, K.G., Lewis, F.L.: Multi-player non-zero-sum games: Online adaptive learning solution of coupled hamilton–jacobi equations. Automatica 47(8), 1556–1569 (2011)
Modares, H., Lewis, F.L., Naghibi-Sistani, M.B.: Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems. Automatica 50(1), 193–202 (2014)
Yasini, S., Naghibi Sitani, M.B., Kirampor, A.: Reinforcement learning and neural networks for multi-agent nonzero-sum games of nonlinear constrained-input systems. International Journal of Machine Learning and Cybernetics 7(6), 967–980 (2016)
Ioannou, P., Fidan, B.: Adaptive control tutorial. SIAM (2006)
Tao, G.: Adaptive control design and analysis, vol. 37. John Wiley & Sons (2003)
Zhang, H., Cui, L., Luo, Y.: Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network adp. IEEE transactions on cybernetics 43(1), 206–216 (2012)
Zhao, D., Zhang, Q., Wang, D., Zhu, Y.: Experience replay for optimal control of nonzero-sum game systems with unknown dynamics. IEEE transactions on cybernetics 46(3), 854–865 (2015)
Jiang, B., Li, C., Ma, G.: Finite-time output feedback attitude control for spacecraft using “adding a power integrator” technique. Aerospace Science and Technology 66, 342–354 (2017)
Bechlioulis, C.P., Rovithakis, G.A.: Robust adaptive control of feedback linearizable mimo nonlinear systems with prescribed performance. IEEE Transactions on Automatic Control 53(9), 2090–2099 (2008)
Du, H., Li, S., Qian, C.: Finite-time attitude tracking control of spacecraft with application to attitude synchronization. IEEE Transactions on Automatic Control 56(11), 2711–2717 (2011)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Gao, Y., Li, D. (2023). Spacecraft Performance Guaranteed Optimal Control with Adaptive Dynamic Programming. In: Spacecraft Maneuver with Performance Guaranteed. Springer, Singapore. https://doi.org/10.1007/978-981-99-4653-2_8
Download citation
DOI: https://doi.org/10.1007/978-981-99-4653-2_8
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-4652-5
Online ISBN: 978-981-99-4653-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)