Neural Q-Learning Based on Residual Gradient for Nonlinear Control Systems

被引：0

作者：

Si, Yanna ^{[1
]}

Pu, Jiexin ^{[1
]}

Zang, Shaofei ^{[1
]}

机构：

[1] Henan Univ Sci & Technol, Sch Informat Engn, Luoyang, Peoples R China

来源：

ICCAIS 2019: THE 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES | 2019年

关键词：

Q-learning; feedforward neural network; value function approximation; residual gradient method; nonlinear control systems;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To solve the control problem of nonlinear system under continuous state space, this paper puts forward a neural Q-learning algorithm based on residual gradient method. Firstly, the multi-layer feedforward neural network is utilized to approximate the Q-value function, overcoming the "dimensional disaster" in the classical reinforcement learning. Then based on the residual gradient method, a mini-batch gradient descent is implemented by the experience replay to update the neural network parameters, which can effectively reduce the iterations number and increase the learning speed. Moreover, the momentum optimization method is introduced to ensure the stability of the training process further and improve the convergence. In order to balance exploration and utilization better, epsilon-decreasing strategy replaces epsilon-greedy for action selection. The simulation results of CartPole control task show the correctness and effectiveness of the proposed algorithm.

引用

页数：5

共 50 条

[21] Q-learning based fault estimation and fault tolerant iterative learning control for MIMO systems
Wang, Rui
Zhuang, Zhihe
Tao, Hongfeng
Paszke, Wojciech
Stojanovic, Vladimir
ISA TRANSACTIONS, 2023, 142 : 123 - 135
[22] Residual-gradient-based neural reinforcement learning for the optimal control of an acrobot
Xu, X
He, HG
PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2002, : 758 - 763
[23] Asynchronous iterative Q-learning based tracking control for nonlinear discrete-time multi-agent systems
Shen, Ziwen
Dong, Tao
Huang, Tingwen
NEURAL NETWORKS, 2024, 180
[24] Neural Q-learning for solving PDEs
Cohen, Samuel N.
Jiang, Deqing
Sirignano, Justin
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[25] An optimal control method for hybrid systems based on Q-learning for an intersection traffic signal control
Zhao, Xiaohua
Li, Zhenlong
Chen, Yangzhou
Li, Yunchi
Gaojishu Tongxin/Chinese High Technology Letters, 2007, 17 (05): : 498 - 502
[26] Power Control Algorithm Based on Q-Learning in Femtocell
Li Yun
Tang Ying
Liu Hanxiao
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2557 - 2564
[27] Switching control of morphing aircraft based on Q-learning
Gong, Ligang
Wang, Qing
Hu, Changhua
Liu, Chen
CHINESE JOURNAL OF AERONAUTICS, 2020, 33 (02) : 672 - 687
[28] Switching control of morphing aircraft based on Q-learning
Ligang GONG
Qing WANG
Changhua HU
Chen LIU
Chinese Journal of Aeronautics, 2020, (02) : 672 - 687
[29] Ramp Metering Control Based on the Q-Learning Algorithm
Ivanjko, Edouard
Necoska, Daniela Koltovska
Greguric, Martin
Vujic, Miroslav
Jurkovic, Goran
Mandzuka, Sadko
CYBERNETICS AND INFORMATION TECHNOLOGIES, 2015, 15 (05) : 88 - 97
[30] Switching control of morphing aircraft based on Q-learning
Ligang GONG
Qing WANG
Changhua HU
Chen LIU
Chinese Journal of Aeronautics, 2020, 33 (02) : 672 - 687

← 1 2 3 4 5 →