A hybrid transfer algorithm for reinforcement learning based on spectral method

被引：0

作者：

机构：

[1] Zhu, Mei-Qiang

[2] Cheng, Yu-Hu

[3] Li, Ming

[4] Wang, Xue-Song

[5] Feng, Huan-Ting

来源：

Zhu, M.-Q. (zhumeiqiang@cumt.edu.cn) | 1765年 / Science Press卷 / 38期

关键词：

Fiedler eigenvector - Hierarchical control structure - Hierarchical decompositions - Laplacian eigenmap - Number of iterations - Proto-Value Functions - Spectral graph theory - Spectral methods;

D O I：

10.3724/SP.J.1004.2012.01765

中图分类号：

学科分类号：

摘要：

For scaling up state space transfer underlying the proto-value function framework, only some basis functions corresponding to smaller eigenvalues are transferred effectively, which will result in wrong approximation of value function in the target task. In order to solve the problem, according to the fact that Laplacian eigenmap can preserve the local topology structure of state space, an improved hierarchical decomposition algorithm based on the spectral graph theory is proposed and a hybrid transfer method integrating basis function transfer with subtask optimal polices transfer is designed. At first, the basis functions of the source task are constructed using spectral method. The basis functions of target task are produced through linearly interpolating basis functions of the source task. Secondly, the produced second basis function of the target task (approximating Fiedler eigenvector) is used to decompose the target task. Then the optimal polices of subtasks are obtained using the improved hierarchical decomposition algorithm. At last, the obtained basis functions and optimal subtask polices are transferred to the target task. The proposed hybrid transfer method can directly get optimal policies of some states, reduce the number of iterations and the minimum number of basis functions needed to approximate the value function. The method is suitable for scaling up state space transfer task with hierarchical control structure. Simulation results of grid world have verified the validity of the proposed hybrid transfer method. © 2012 Acta Automatica Sinica.

引用

共 50 条

[31] Learning to Predict Consequences as a Method of Knowledge Transfer in Reinforcement Learning
Chalmers, Eric
Contreras, Edgar Bermudez
Robertson, Brandon
Luczak, Artur
Gruber, Aaron
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2259 - 2270
[32] VR Scene Detail Enhancement Method Based on Depth Reinforcement Learning Algorithm
Feng, Changbao
Tong, Xin
Zhu, Meili
Qu, Feng
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
[33] Action-Selection Method for Reinforcement Learning Based on Cuckoo Search Algorithm
Bilal H. Abed-alguni
Arabian Journal for Science and Engineering, 2018, 43 : 6771 - 6785
[34] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
Meng, Haitao
Zhang, Hengrui
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
[35] UAVs Maneuver Decision-Making Method Based on Transfer Reinforcement Learning
Zhu, Jindong
Fu, Xiaowei
Qiao, Zhe
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022 : 2399796
[36] Joint Reinforcement Learning Method Based on Roulette Algorithm and Simulated Annealing Strategy
Hu Jin-bo
Yang Rui-jun
Cheng Yan
2020 5TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS 2020), 2020, : 34 - 37
[37] A convergent Reinforcement Learning algorithm in the continuous case based on a Finite Difference method
Munos, R
IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 826 - 831
[38] OPTIMIZATION ALGORITHM FOR INTERPLANETARY TRANSFER TRAJECTORIES OF SOLAR SAILCRAFT BASED ON DEEP REINFORCEMENT LEARNING
Zhou, Chengyang
Cheng, Lin
Zhang, Qingzhen
Fang, Ke
FOURTH IAA CONFERENCE ON DYNAMICS AND CONTROL OF SPACE SYSTEMS 2018, PTS I-III, 2018, 165 : 1265 - 1275
[39] Action-Selection Method for Reinforcement Learning Based on Cuckoo Search Algorithm
Abed-alguni, Bilal H.
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 6771 - 6785
[40] Reinforcement Learning Based on Active Learning Method
Sagha, Hesam
Shouraki, Saeed Bagheri
Khasteh, Hosein
Kiaei, Ali Akbar
2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 598 - +

← 1 2 3 4 5 →