Neural Q-Learning Based on Residual Gradient for Nonlinear Control Systems

被引:0
|
作者
Si, Yanna [1 ]
Pu, Jiexin [1 ]
Zang, Shaofei [1 ]
机构
[1] Henan Univ Sci & Technol, Sch Informat Engn, Luoyang, Peoples R China
关键词
Q-learning; feedforward neural network; value function approximation; residual gradient method; nonlinear control systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To solve the control problem of nonlinear system under continuous state space, this paper puts forward a neural Q-learning algorithm based on residual gradient method. Firstly, the multi-layer feedforward neural network is utilized to approximate the Q-value function, overcoming the "dimensional disaster" in the classical reinforcement learning. Then based on the residual gradient method, a mini-batch gradient descent is implemented by the experience replay to update the neural network parameters, which can effectively reduce the iterations number and increase the learning speed. Moreover, the momentum optimization method is introduced to ensure the stability of the training process further and improve the convergence. In order to balance exploration and utilization better, epsilon-decreasing strategy replaces epsilon-greedy for action selection. The simulation results of CartPole control task show the correctness and effectiveness of the proposed algorithm.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Q-learning based fault estimation and fault tolerant iterative learning control for MIMO systems
    Wang, Rui
    Zhuang, Zhihe
    Tao, Hongfeng
    Paszke, Wojciech
    Stojanovic, Vladimir
    ISA TRANSACTIONS, 2023, 142 : 123 - 135
  • [22] Residual-gradient-based neural reinforcement learning for the optimal control of an acrobot
    Xu, X
    He, HG
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2002, : 758 - 763
  • [23] Asynchronous iterative Q-learning based tracking control for nonlinear discrete-time multi-agent systems
    Shen, Ziwen
    Dong, Tao
    Huang, Tingwen
    NEURAL NETWORKS, 2024, 180
  • [24] Neural Q-learning for solving PDEs
    Cohen, Samuel N.
    Jiang, Deqing
    Sirignano, Justin
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [25] An optimal control method for hybrid systems based on Q-learning for an intersection traffic signal control
    Zhao, Xiaohua
    Li, Zhenlong
    Chen, Yangzhou
    Li, Yunchi
    Gaojishu Tongxin/Chinese High Technology Letters, 2007, 17 (05): : 498 - 502
  • [26] Power Control Algorithm Based on Q-Learning in Femtocell
    Li Yun
    Tang Ying
    Liu Hanxiao
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2557 - 2564
  • [27] Switching control of morphing aircraft based on Q-learning
    Gong, Ligang
    Wang, Qing
    Hu, Changhua
    Liu, Chen
    CHINESE JOURNAL OF AERONAUTICS, 2020, 33 (02) : 672 - 687
  • [28] Switching control of morphing aircraft based on Q-learning
    Ligang GONG
    Qing WANG
    Changhua HU
    Chen LIU
    Chinese Journal of Aeronautics, 2020, (02) : 672 - 687
  • [29] Ramp Metering Control Based on the Q-Learning Algorithm
    Ivanjko, Edouard
    Necoska, Daniela Koltovska
    Greguric, Martin
    Vujic, Miroslav
    Jurkovic, Goran
    Mandzuka, Sadko
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2015, 15 (05) : 88 - 97
  • [30] Switching control of morphing aircraft based on Q-learning
    Ligang GONG
    Qing WANG
    Changhua HU
    Chen LIU
    Chinese Journal of Aeronautics, 2020, 33 (02) : 672 - 687