A hybrid transfer algorithm for reinforcement learning based on spectral method

被引：0

作者：

机构：

[1] Zhu, Mei-Qiang

[2] Cheng, Yu-Hu

[3] Li, Ming

[4] Wang, Xue-Song

[5] Feng, Huan-Ting

来源：

Zhu, M.-Q. (zhumeiqiang@cumt.edu.cn) | 1765年 / Science Press卷 / 38期

关键词：

Fiedler eigenvector - Hierarchical control structure - Hierarchical decompositions - Laplacian eigenmap - Number of iterations - Proto-Value Functions - Spectral graph theory - Spectral methods;

D O I：

10.3724/SP.J.1004.2012.01765

中图分类号：

学科分类号：

摘要：

For scaling up state space transfer underlying the proto-value function framework, only some basis functions corresponding to smaller eigenvalues are transferred effectively, which will result in wrong approximation of value function in the target task. In order to solve the problem, according to the fact that Laplacian eigenmap can preserve the local topology structure of state space, an improved hierarchical decomposition algorithm based on the spectral graph theory is proposed and a hybrid transfer method integrating basis function transfer with subtask optimal polices transfer is designed. At first, the basis functions of the source task are constructed using spectral method. The basis functions of target task are produced through linearly interpolating basis functions of the source task. Secondly, the produced second basis function of the target task (approximating Fiedler eigenvector) is used to decompose the target task. Then the optimal polices of subtasks are obtained using the improved hierarchical decomposition algorithm. At last, the obtained basis functions and optimal subtask polices are transferred to the target task. The proposed hybrid transfer method can directly get optimal policies of some states, reduce the number of iterations and the minimum number of basis functions needed to approximate the value function. The method is suitable for scaling up state space transfer task with hierarchical control structure. Simulation results of grid world have verified the validity of the proposed hybrid transfer method. © 2012 Acta Automatica Sinica.

引用

共 50 条

[21] Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation
Wang, Ze
Liao, Guogang
Shi, Xiaowen
Wu, Xiaoxu
Zhang, Chuheng
Zhu, Bingqi
Wang, Yongkang
Wang, Xingxing
Wang, Dong
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4560 - 4564
[22] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
Li, Ning
Tang, Jichuan
Li, Zhong-Xian
Gao, Xiuyu
Structural Control and Health Monitoring, 2022, 29 (10)
[23] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
Li, Ning
Tang, Jichuan
Li, Zhong-Xian
Gao, Xiuyu
STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (10):
[24] Fault diagnosis method based on hybrid immune learning algorithm
Wang, Cunjie
Zhao, Yuhong
ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2007, : 662 - +
[25] Optimization of Energy Management Algorithm for Hybrid Power Systems Based on Deep Reinforcement Learning
Ban, Lan
STUDIES IN INFORMATICS AND CONTROL, 2024, 33 (02): : 15 - 25
[26] Hierarchical decision algorithm for air combat with hybrid action based on deep reinforcement learning
Li, Zuolong
Zhu, Jihong
Kuang, Minchi
Zhang, Jie
Ren, Jie
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (17):
[27] Reinforcement Learning-Based Hybrid Multi-Objective Optimization Algorithm Design
Palm, Herbert
Arndt, Lorin
INFORMATION, 2023, 14 (05)
[28] Novel best path selection approach based on hybrid improved A* algorithm and reinforcement learning
Xiaohuan Liu
Degan Zhang
Ting Zhang
Yuya Cui
Lu Chen
Si Liu
Applied Intelligence, 2021, 51 : 9015 - 9029
[29] Research on Control Strategy of Hybrid Superconducting Energy Storage Based on Reinforcement Learning Algorithm
Liu, Yang
Han, Xingfan
Xing, Zuoxia
Li, Pengtao
Liu, Hengyu
Jiang, Zhanpeng
IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY, 2024, 34 (08) : 1 - 4
[30] Novel best path selection approach based on hybrid improved A* algorithm and reinforcement learning
Liu, Xiaohuan
Zhang, Degan
Zhang, Ting
Cui, Yuya
Chen, Lu
Liu, Si
APPLIED INTELLIGENCE, 2021, 51 (12) : 9015 - 9029

← 1 2 3 4 5 →