A hybrid transfer algorithm for reinforcement learning based on spectral method

被引:0
|
作者
机构
[1] Zhu, Mei-Qiang
[2] Cheng, Yu-Hu
[3] Li, Ming
[4] Wang, Xue-Song
[5] Feng, Huan-Ting
来源
Zhu, M.-Q. (zhumeiqiang@cumt.edu.cn) | 1765年 / Science Press卷 / 38期
关键词
Fiedler eigenvector - Hierarchical control structure - Hierarchical decompositions - Laplacian eigenmap - Number of iterations - Proto-Value Functions - Spectral graph theory - Spectral methods;
D O I
10.3724/SP.J.1004.2012.01765
中图分类号
学科分类号
摘要
For scaling up state space transfer underlying the proto-value function framework, only some basis functions corresponding to smaller eigenvalues are transferred effectively, which will result in wrong approximation of value function in the target task. In order to solve the problem, according to the fact that Laplacian eigenmap can preserve the local topology structure of state space, an improved hierarchical decomposition algorithm based on the spectral graph theory is proposed and a hybrid transfer method integrating basis function transfer with subtask optimal polices transfer is designed. At first, the basis functions of the source task are constructed using spectral method. The basis functions of target task are produced through linearly interpolating basis functions of the source task. Secondly, the produced second basis function of the target task (approximating Fiedler eigenvector) is used to decompose the target task. Then the optimal polices of subtasks are obtained using the improved hierarchical decomposition algorithm. At last, the obtained basis functions and optimal subtask polices are transferred to the target task. The proposed hybrid transfer method can directly get optimal policies of some states, reduce the number of iterations and the minimum number of basis functions needed to approximate the value function. The method is suitable for scaling up state space transfer task with hierarchical control structure. Simulation results of grid world have verified the validity of the proposed hybrid transfer method. © 2012 Acta Automatica Sinica.
引用
收藏
相关论文
共 50 条
  • [31] Learning to Predict Consequences as a Method of Knowledge Transfer in Reinforcement Learning
    Chalmers, Eric
    Contreras, Edgar Bermudez
    Robertson, Brandon
    Luczak, Artur
    Gruber, Aaron
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2259 - 2270
  • [32] VR Scene Detail Enhancement Method Based on Depth Reinforcement Learning Algorithm
    Feng, Changbao
    Tong, Xin
    Zhu, Meili
    Qu, Feng
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [33] Action-Selection Method for Reinforcement Learning Based on Cuckoo Search Algorithm
    Bilal H. Abed-alguni
    Arabian Journal for Science and Engineering, 2018, 43 : 6771 - 6785
  • [34] Mobile Robot Path Planning Method Based on Deep Reinforcement Learning Algorithm
    Meng, Haitao
    Zhang, Hengrui
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
  • [35] UAVs Maneuver Decision-Making Method Based on Transfer Reinforcement Learning
    Zhu, Jindong
    Fu, Xiaowei
    Qiao, Zhe
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022 : 2399796
  • [36] Joint Reinforcement Learning Method Based on Roulette Algorithm and Simulated Annealing Strategy
    Hu Jin-bo
    Yang Rui-jun
    Cheng Yan
    2020 5TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS 2020), 2020, : 34 - 37
  • [37] A convergent Reinforcement Learning algorithm in the continuous case based on a Finite Difference method
    Munos, R
    IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 826 - 831
  • [38] OPTIMIZATION ALGORITHM FOR INTERPLANETARY TRANSFER TRAJECTORIES OF SOLAR SAILCRAFT BASED ON DEEP REINFORCEMENT LEARNING
    Zhou, Chengyang
    Cheng, Lin
    Zhang, Qingzhen
    Fang, Ke
    FOURTH IAA CONFERENCE ON DYNAMICS AND CONTROL OF SPACE SYSTEMS 2018, PTS I-III, 2018, 165 : 1265 - 1275
  • [39] Action-Selection Method for Reinforcement Learning Based on Cuckoo Search Algorithm
    Abed-alguni, Bilal H.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 6771 - 6785
  • [40] Reinforcement Learning Based on Active Learning Method
    Sagha, Hesam
    Shouraki, Saeed Bagheri
    Khasteh, Hosein
    Kiaei, Ali Akbar
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 598 - +