A hybrid transfer algorithm for reinforcement learning based on spectral method

被引:0
|
作者
机构
[1] Zhu, Mei-Qiang
[2] Cheng, Yu-Hu
[3] Li, Ming
[4] Wang, Xue-Song
[5] Feng, Huan-Ting
来源
Zhu, M.-Q. (zhumeiqiang@cumt.edu.cn) | 1765年 / Science Press卷 / 38期
关键词
Fiedler eigenvector - Hierarchical control structure - Hierarchical decompositions - Laplacian eigenmap - Number of iterations - Proto-Value Functions - Spectral graph theory - Spectral methods;
D O I
10.3724/SP.J.1004.2012.01765
中图分类号
学科分类号
摘要
For scaling up state space transfer underlying the proto-value function framework, only some basis functions corresponding to smaller eigenvalues are transferred effectively, which will result in wrong approximation of value function in the target task. In order to solve the problem, according to the fact that Laplacian eigenmap can preserve the local topology structure of state space, an improved hierarchical decomposition algorithm based on the spectral graph theory is proposed and a hybrid transfer method integrating basis function transfer with subtask optimal polices transfer is designed. At first, the basis functions of the source task are constructed using spectral method. The basis functions of target task are produced through linearly interpolating basis functions of the source task. Secondly, the produced second basis function of the target task (approximating Fiedler eigenvector) is used to decompose the target task. Then the optimal polices of subtasks are obtained using the improved hierarchical decomposition algorithm. At last, the obtained basis functions and optimal subtask polices are transferred to the target task. The proposed hybrid transfer method can directly get optimal policies of some states, reduce the number of iterations and the minimum number of basis functions needed to approximate the value function. The method is suitable for scaling up state space transfer task with hierarchical control structure. Simulation results of grid world have verified the validity of the proposed hybrid transfer method. © 2012 Acta Automatica Sinica.
引用
收藏
相关论文
共 50 条
  • [21] Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation
    Wang, Ze
    Liao, Guogang
    Shi, Xiaowen
    Wu, Xiaoxu
    Zhang, Chuheng
    Zhu, Bingqi
    Wang, Yongkang
    Wang, Xingxing
    Wang, Dong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4560 - 4564
  • [22] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    Structural Control and Health Monitoring, 2022, 29 (10)
  • [23] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (10):
  • [24] Fault diagnosis method based on hybrid immune learning algorithm
    Wang, Cunjie
    Zhao, Yuhong
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2007, : 662 - +
  • [25] Optimization of Energy Management Algorithm for Hybrid Power Systems Based on Deep Reinforcement Learning
    Ban, Lan
    STUDIES IN INFORMATICS AND CONTROL, 2024, 33 (02): : 15 - 25
  • [26] Hierarchical decision algorithm for air combat with hybrid action based on deep reinforcement learning
    Li, Zuolong
    Zhu, Jihong
    Kuang, Minchi
    Zhang, Jie
    Ren, Jie
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (17):
  • [27] Reinforcement Learning-Based Hybrid Multi-Objective Optimization Algorithm Design
    Palm, Herbert
    Arndt, Lorin
    INFORMATION, 2023, 14 (05)
  • [28] Novel best path selection approach based on hybrid improved A* algorithm and reinforcement learning
    Xiaohuan Liu
    Degan Zhang
    Ting Zhang
    Yuya Cui
    Lu Chen
    Si Liu
    Applied Intelligence, 2021, 51 : 9015 - 9029
  • [29] Research on Control Strategy of Hybrid Superconducting Energy Storage Based on Reinforcement Learning Algorithm
    Liu, Yang
    Han, Xingfan
    Xing, Zuoxia
    Li, Pengtao
    Liu, Hengyu
    Jiang, Zhanpeng
    IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY, 2024, 34 (08) : 1 - 4
  • [30] Novel best path selection approach based on hybrid improved A* algorithm and reinforcement learning
    Liu, Xiaohuan
    Zhang, Degan
    Zhang, Ting
    Cui, Yuya
    Chen, Lu
    Liu, Si
    APPLIED INTELLIGENCE, 2021, 51 (12) : 9015 - 9029