A hybrid transfer algorithm for reinforcement learning based on spectral method

被引:0
|
作者
机构
[1] Zhu, Mei-Qiang
[2] Cheng, Yu-Hu
[3] Li, Ming
[4] Wang, Xue-Song
[5] Feng, Huan-Ting
来源
Zhu, M.-Q. (zhumeiqiang@cumt.edu.cn) | 1765年 / Science Press卷 / 38期
关键词
Fiedler eigenvector - Hierarchical control structure - Hierarchical decompositions - Laplacian eigenmap - Number of iterations - Proto-Value Functions - Spectral graph theory - Spectral methods;
D O I
10.3724/SP.J.1004.2012.01765
中图分类号
学科分类号
摘要
For scaling up state space transfer underlying the proto-value function framework, only some basis functions corresponding to smaller eigenvalues are transferred effectively, which will result in wrong approximation of value function in the target task. In order to solve the problem, according to the fact that Laplacian eigenmap can preserve the local topology structure of state space, an improved hierarchical decomposition algorithm based on the spectral graph theory is proposed and a hybrid transfer method integrating basis function transfer with subtask optimal polices transfer is designed. At first, the basis functions of the source task are constructed using spectral method. The basis functions of target task are produced through linearly interpolating basis functions of the source task. Secondly, the produced second basis function of the target task (approximating Fiedler eigenvector) is used to decompose the target task. Then the optimal polices of subtasks are obtained using the improved hierarchical decomposition algorithm. At last, the obtained basis functions and optimal subtask polices are transferred to the target task. The proposed hybrid transfer method can directly get optimal policies of some states, reduce the number of iterations and the minimum number of basis functions needed to approximate the value function. The method is suitable for scaling up state space transfer task with hierarchical control structure. Simulation results of grid world have verified the validity of the proposed hybrid transfer method. © 2012 Acta Automatica Sinica.
引用
收藏
相关论文
共 50 条
  • [1] A substructure transfer reinforcement learning method based on metric learning
    Chai, Peihua
    Chen, Bilian
    Zeng, Yifeng
    Yu, Shenbao
    NEUROCOMPUTING, 2024, 598
  • [2] Hybrid algorithm based on reinforcement learning for smart inventory management
    Cuartas, Carlos
    Aguilar, Jose
    JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (01) : 123 - 149
  • [3] Hybrid algorithm based on reinforcement learning for smart inventory management
    Carlos Cuartas
    Jose Aguilar
    Journal of Intelligent Manufacturing, 2023, 34 : 123 - 149
  • [4] A new path plan method based on hybrid algorithm of reinforcement learning and particle swarm optimization
    Liu, Xiaohuan
    Zhang, Degan
    Zhang, Ting
    Zhang, Jie
    Wang, Jiaxu
    ENGINEERING COMPUTATIONS, 2022, 39 (03) : 993 - 1019
  • [5] Recommendation algorithm based on improved spectral clustering and transfer learning
    Xiang Li
    Zhijian Wang
    Ronglin Hu
    Quanyin Zhu
    Liuyang Wang
    Pattern Analysis and Applications, 2019, 22 : 633 - 647
  • [6] Recommendation algorithm based on improved spectral clustering and transfer learning
    Li, Xiang
    Wang, Zhijian
    Hu, Ronglin
    Zhu, Quanyin
    Wang, Liuyang
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (02) : 633 - 647
  • [7] Hybrid Dynamic Control Algorithm for Humanoid Robots Based on Reinforcement Learning
    Duśko M. Katić
    Aleksandar D. Rodić
    Miomir K. Vukobratović
    Journal of Intelligent and Robotic Systems, 2008, 51 : 3 - 30
  • [8] Hybrid dynamic control algorithm for humanoid robots based on reinforcement learning
    Katic, Dusko M.
    Rodic, Aleksandar D.
    Vukobratovic, Miomir K.
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2008, 51 (01) : 3 - 30
  • [9] A concept learning method based on a hybrid genetic algorithm
    Juan Liu
    Weihua Li
    Science in China Series E: Technological Sciences, 1998, 41 : 488 - 495
  • [10] A concept learning method based on a hybrid genetic algorithm
    刘娟
    李卫华
    Science in China(Series E:Technological Sciences), 1998, (05) : 488 - 495