Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics

Zhu Yuanheng Zhao Dongbin Li Xiangjun · 2016

阅读量：135

期刊名称：

IET Control Theory and Applications 2016 年 10 卷 12 期

发表日期：

2016.08.08

摘要：

The optimal tracking of non-linear systems without knowing system dynamics is an important and intractable problem. Based on the framework of reinforcement learning (RL) and adaptive dynamic programming, a model-free adaptive optimal tracking algorithm is proposed in this study. After constructing an augmented system with the tracking errors and the reference states, the tracking problem is converted to a regulation problem with respect to the new system. Several RL techniques are synthesised to form a novel algorithm which learns the optimal solution online in real time without any information of the system dynamics. Continuous adaptation laws are defined by the current observations and the past experience. The convergence is guaranteed by Lyapunov analysis. Two simulations on a linear and a non-linear systems demonstrate the performance of the proposed approach.