Output-feedback H∞ quadratic tracking control of linear systems using reinforcement learning
This paper presents an online learning algorithm based on integral reinforcement learning (IRL) to design an output-feedback (OPFB) H∞ tracking controller for partially unknown linear continuous-time systems. Although reinforcement learning techniques have been successfully applied to find optimal state-feedback controllers, in most control applications, it is not practical to measure the full system states. Therefore, it is desired to design OPFB controllers. To this end, a general bounded L2-gain tracking problem with a discounted performance function is used for the OPFB H∞ tracking. A tracking game algebraic Riccati equation is then developed that gives a Nash equilibrium solution to the associated min-max optimization problem. An IRL algorithm is then developed to solve the game algebraic Riccati equation online without requiring complete knowledge of the system dynamics. The proposed IRL-based algorithm solves an IRL Bellman equation in each iteration online in real time to evaluate an OPFB policy and updates the OPFB gain using the information given by the evaluated policy. An adaptive observer is used to provide the knowledge of the full states for the IRL Bellman equation during learning. However, the observer is not needed after the learning process is finished. A simulation example is provided to verify the convergence of the proposed algorithm to a suboptimal OPFB solution and the performance of the proposed method. © 2017 John Wiley & Sons, Ltd.
International Journal of Adaptive Control and Signal Processing
Moghadam, R. and Lewis, F. L. (2019). Output-feedback H∞ quadratic tracking control of linear systems using reinforcement learning. International Journal of Adaptive Control Signal Processing 33(2): 300– 314. https://doi.org/10.1002/acs.2830