A hybrid transfer algorithm for reinforcement learning based on spectral method

被引：0

作者：

机构：

[1] Zhu, Mei-Qiang

[2] Cheng, Yu-Hu

[3] Li, Ming

[4] Wang, Xue-Song

[5] Feng, Huan-Ting

来源：

Zhu, M.-Q. (zhumeiqiang@cumt.edu.cn) | 1765年 / Science Press卷 / 38期

关键词：

Fiedler eigenvector - Hierarchical control structure - Hierarchical decompositions - Laplacian eigenmap - Number of iterations - Proto-Value Functions - Spectral graph theory - Spectral methods;

D O I：

10.3724/SP.J.1004.2012.01765

中图分类号：

学科分类号：

摘要：

For scaling up state space transfer underlying the proto-value function framework, only some basis functions corresponding to smaller eigenvalues are transferred effectively, which will result in wrong approximation of value function in the target task. In order to solve the problem, according to the fact that Laplacian eigenmap can preserve the local topology structure of state space, an improved hierarchical decomposition algorithm based on the spectral graph theory is proposed and a hybrid transfer method integrating basis function transfer with subtask optimal polices transfer is designed. At first, the basis functions of the source task are constructed using spectral method. The basis functions of target task are produced through linearly interpolating basis functions of the source task. Secondly, the produced second basis function of the target task (approximating Fiedler eigenvector) is used to decompose the target task. Then the optimal polices of subtasks are obtained using the improved hierarchical decomposition algorithm. At last, the obtained basis functions and optimal subtask polices are transferred to the target task. The proposed hybrid transfer method can directly get optimal policies of some states, reduce the number of iterations and the minimum number of basis functions needed to approximate the value function. The method is suitable for scaling up state space transfer task with hierarchical control structure. Simulation results of grid world have verified the validity of the proposed hybrid transfer method. © 2012 Acta Automatica Sinica.

引用

共 50 条

[1] A substructure transfer reinforcement learning method based on metric learning
Chai, Peihua
Chen, Bilian
Zeng, Yifeng
Yu, Shenbao
NEUROCOMPUTING, 2024, 598
[2] Hybrid algorithm based on reinforcement learning for smart inventory management
Cuartas, Carlos
Aguilar, Jose
JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (01) : 123 - 149
[3] Hybrid algorithm based on reinforcement learning for smart inventory management
Carlos Cuartas
Jose Aguilar
Journal of Intelligent Manufacturing, 2023, 34 : 123 - 149
[4] A new path plan method based on hybrid algorithm of reinforcement learning and particle swarm optimization
Liu, Xiaohuan
Zhang, Degan
Zhang, Ting
Zhang, Jie
Wang, Jiaxu
ENGINEERING COMPUTATIONS, 2022, 39 (03) : 993 - 1019
[5] Recommendation algorithm based on improved spectral clustering and transfer learning
Xiang Li
Zhijian Wang
Ronglin Hu
Quanyin Zhu
Liuyang Wang
Pattern Analysis and Applications, 2019, 22 : 633 - 647
[6] Recommendation algorithm based on improved spectral clustering and transfer learning
Li, Xiang
Wang, Zhijian
Hu, Ronglin
Zhu, Quanyin
Wang, Liuyang
PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (02) : 633 - 647
[7] Hybrid Dynamic Control Algorithm for Humanoid Robots Based on Reinforcement Learning
Duśko M. Katić
Aleksandar D. Rodić
Miomir K. Vukobratović
Journal of Intelligent and Robotic Systems, 2008, 51 : 3 - 30
[8] Hybrid dynamic control algorithm for humanoid robots based on reinforcement learning
Katic, Dusko M.
Rodic, Aleksandar D.
Vukobratovic, Miomir K.
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2008, 51 (01) : 3 - 30
[9] A concept learning method based on a hybrid genetic algorithm
Juan Liu
Weihua Li
Science in China Series E: Technological Sciences, 1998, 41 : 488 - 495
[10] A concept learning method based on a hybrid genetic algorithm
刘娟
李卫华
Science in China(Series E:Technological Sciences), 1998, (05) : 488 - 495

← 1 2 3 4 5 →