Abstract
External disturbances and asymmetric input constraints may cause a major problem to the optimal control of the system. Aiming at such problem, this article presents a safe and optimal robust control method based on adaptive dynamic programming (ADP) to ensure the system operated in a safe region and with the optimal performance. Initially, a novel nonquadratic form cost function is imported for the system to address the asymmetric input constraints. Then, to ensure the safety of the system, a control barrier function (CBF) is appended to the cost function to penalize the unsafe behavior. And a damping factor is also introduced to the CBF to balance safety and optimality. Finally, one single critic network is utilized to simplify the complex computational steps, which is different from the traditional actor-critic networks to address the Hamilton-Jacobi-Bellman Equation (HJBE) for obtaining the optimal neural controller. Additionally, based on Lyapunov method, all signals in the closed-loop system are proven to be uniformly ultimately bounded (UUB). At last, the experimental results confirm the effectiveness of the designed approach.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability Statement
The authors can confirm that all relevant data are included in the article.
References
Liu D, Xue S, Zhao B, Luo B, Wei Q (2020) Adaptive dynamic programming for control: a survey and recent advances. IEEE Transactions on Systems, Man, and Cybernetics: Systems 51(1):142–160
Wang D, Ha M, Qiao J (2020) Data-driven iterative adaptive critic control toward an urban wastewater treatment plant. IEEE Trans Industr Electron 68(8):7362–7369
Wang D, Qiao J, Cheng L (2020) An approximate neuro-optimal solution of discounted guaranteed cost control design. IEEE Transactions on Cybernetics 52(1):77–86
Wang D, Li X, Zhao M, Qiao J (2023) Adaptive critic control design with knowledge transfer for wastewater treatment applications. IEEE Transactions on industrial informatics
Wei Q, Zhou T, Lu J, Liu Y, Su S, Xiao J (2023) Continuous-time stochastic policy iteration of adaptive dynamic programming. IEEE Transactions on Systems, Man, and Cybernetics: Systems
Qin C, Wang J, Zhu H, Zhang J, Hu S, Zhang D (2022) Neural network-based safe optimal robust control for affine nonlinear systems with unmatched disturbances. Neurocomputing 506:228–239
Song R, Liu L, Xia L, Lewis FL (2022) Online optimal event-triggered h\(\infty \) control for nonlinear systems with constrained state and input. IEEE Transactions on Systems, Man, and Cybernetics: Systems 53(1):131–141
Bellman R (1966) Dynamic programming. Science 153(3731):34–37
Werbos P (1977) Advanced forecasting methods for global crisis warning and models of intelligence. Gen Syst Yearb 25–38
Chauhan S, Singh M, Aggarwal AK (2023) Designing of optimal digital IIR filter in the multi-objective framework using an evolutionary algorithm. Eng Appl Artif Intell 119:105803
Chauhan S, Singh M, Aggarwal AK (2021) Experimental analysis of effect of tuning parameters on the performance of diversity-driven multi-parent evolutionary algorithm. In:2021 IEEE 2Nd International conference on electrical power and energy systems (ICEPES). IEEE, pp 1–6
Chauhan S, Singh M, Aggarwal AK (2023) Investigative analysis of different mutation on diversity-driven multi-parent evolutionary algorithm and its application in area coverage optimization of WSN. Soft Comput 1–27
Wang N, Gao Y, Yang C, Zhang X (2022) Reinforcement learning-based finite-time tracking control of an unknown unmanned surface vehicle with input constraints. Neurocomputing 484:26–37
Sun J, Zhang H, Yan Y, Xu S, Fan X (2021) Optimal regulation strategy for nonzero-sum games of the immune system using adaptive dynamic programming. IEEE Transactions on cybernetics
Zhang H, Wang H, Niu B, Zhang L, Ahmad AM (2021) Sliding-mode surface-based adaptive actor-critic optimal control for switched nonlinear systems with average dwell time. Inf Sci 580:756–774
Vamvoudakis KG, Lewis FL (2010) Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 46(5):878–888
Li D, Dong J (2022) Fuzzy control based on reinforcement learning and subsystem error derivatives for strict-feedback systems with an observer. IEEE Transactions on Fuzzy Systems
Li D, Dong J (2023) Fuzzy weight-based reinforcement learning for event-triggered optimal backstepping control of fractional-order nonlinear systems. IEEE Transactions on Fuzzy Systems
Huang X, Dong J (2020) ADP-based robust resilient control of partially unknown nonlinear systems via cooperative interaction design. IEEE Transactions on Systems, Man, and Cybernetics: Systems 51(12):7466–7474
Wang K, Mu C, Ni Z, Liu D (2023) Safe reinforcement learning and adaptive optimal control with applications to obstacle avoidance problem. IEEE Transactions on Automation Science and Engineering
Farzanegan B, Jagannathan S (2023) Continual reinforcement learning formulation for zero-sum game-based constrained optimal tracking. IEEE Transactions on Systems, Man, and Cybernetics: Systems
Marvi Z, Kiumarsi B (2021) Safe reinforcement learning: a control barrier function optimization approach. Int J Robust Nonlinear Control 31(6):1923–1940
Qin C, Zhang Z, Shang Z, Zhang J, Zhang D (2023) Adaptive optimal safety tracking control for multiplayer mixed zero-sum games of continuous-time systems. Appl Intell 1–16
Shi L, Wang X, Cheng Y (2023) Safe reinforcement learning-based robust approximate optimal control for hypersonic flight vehicles. IEEE Transactions on vehicular technology
Qin C, Qiao X, Wang J, Zhang D, Hou Y, Hu, S (2023) Barrier-Critic adaptive robust control of nonzero-sum differential games for uncertain nonlinear systems with state constraints. IEEE Transactions on Systems, Man, and Cybernetics: Systems
Zhang Y, Zhao B, Liu D, Zhang S (2022) Adaptive dynamic programming-based event-triggered robust control for multiplayer nonzero-sum games with unknown dynamics1-4mmplease verify and confirm the term “multi-player” has been changed to “multiplayer” in the title of this article. IEEE Transactions on Cybernetics
Zhao J, Na J, Gao G (2022) Robust tracking control of uncertain nonlinear systems with adaptive dynamic programming. Neurocomputing 471:21–30
Yang X, Xu M, Wei Q (2023) Adaptive dynamic programming for nonlinear-constrained h\(\infty \) control. IEEE Transactions on Systems, Man, and Cybernetics: Systems
Ji R, Ge SS, Li D (2023) Saturation-tolerant prescribed control for nonlinear systems with unknown control directions and external disturbances. IEEE Transactions on Cybernetics
Xu S, He B (2023) Robust adaptive fuzzy fault tolerant control of robot manipulators with unknown parameters. IEEE Transactions on fuzzy systems
Yang M, Ma H, Li X, Shang C, Shen Q (2022) Bus bridging for rail disruptions: a distributionally robust fuzzy optimization approach. IEEE Transactions on Fuzzy Systems
Gutierrez-Oribio D, Orlov Y, Stefanou I, Plestan F (2022) Robust tracking for the diffusion equation using sliding-mode boundary control. In: 2022 IEEE 61st Conference on decision and control (CDC). IEEE, pp 6076–6081
Chen J, Lyu L, Fei Z, Xia W, Sun X-M (2023) Event-triggered adaptive robust control for a class of uncertain nonlinear systems with application to mechatronic system. IEEE Transactions on Industrial Informatics
Sun N, Liang D, Wu Y, Chen Y, Qin Y, Fang Y (2019) Adaptive control for pneumatic artificial muscle systems with parametric uncertainties and unidirectional input constraints. IEEE Trans Industr Inf 16(2):969–979
Zhu Y, Zhao D, He H, Ji J (2016) Event-triggered optimal control for partially unknown constrained-input systems via adaptive dynamic programming. IEEE Trans Industr Electron 64(5):4101–4109
Wu Q, Zhao B, Liu D, Polycarpou MM (2023) Event-triggered adaptive dynamic programming for decentralized tracking control of input constrained unknown nonlinear interconnected systems. Neural Netw 157:336–349
Xue S, Luo B, Liu D, Li Y (2020) Adaptive dynamic programming based event-triggered control for unknown continuous-time nonlinear systems with input constraints. Neurocomputing 396:191–200
Yang X, Zhao B (2020) Optimal neuro-control strategy for nonlinear systems with asymmetric input constraints. IEEE/CAA Journal of Automatica Sinica 7(2):575–583
Kong L, He W, Dong Y, Cheng L, Yang C, Li Z (2019) Asymmetric bounded neural control for an uncertain robot by state feedback and output feedback. IEEE Transactions on Systems, Man, and Cybernetics: Systems 51(3):1735–1746
Zhao Y, Wang H, Xu N, Zong G, Zhao X (2023) Reinforcement learning-based decentralized fault tolerant control for constrained interconnected nonlinear systems. Chaos, Solitons & Fractals 167:113034
Qiao J, Li M, Wang D (2022) Asymmetric constrained optimal tracking control with critic learning of nonlinear multiplayer zero-sum games. IEEE Transactions on Neural Networks and Learning Systems
Yang X, Wei Q (2020) Adaptive critic learning for constrained optimal event-triggered control with discounted cost. IEEE Transactions on Neural Networks and Learning Systems 32(1):91–104
Sun Y, Li C, Qin H, Deng Z, Chen Z (2022) Robust neural network-based tracking control for unmanned surface vessels under deferred asymmetric constraints. Int J Robust Nonlinear Control 32(5):2741–2759
Yang X, Zhou Y, Gao Z (2023) Reinforcement learning for robust stabilization of nonlinear systems with asymmetric saturating actuators. Neural Netw 158:132–141
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China under Grant (62001359), in part by the Fundamental Research Funds for the Central Universities (ZYTS23163), in part by the Science and Technology Research Project of the Henan Province under Grant (222102240014, 232102211059, 232102211047).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of Interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, D., Wang, Y., Jiang, K. et al. Safe optimal robust control of nonlinear systems with asymmetric input constraints using reinforcement learning. Appl Intell 54, 1–13 (2024). https://doi.org/10.1007/s10489-023-05184-1
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-023-05184-1