A substructure transfer reinforcement learning method based on metric learning

被引:0
|
作者
Chai, Peihua [1 ,2 ]
Chen, Bilian [1 ,2 ]
Zeng, Yifeng [3 ]
Yu, Shenbao [4 ]
机构
[1] Xiamen Univ, Sch Aerosp Engn, Dept Automat, Xiamen 361005, Peoples R China
[2] Xiamen Key Lab Big Data Intelligent Anal & Decis M, Xiamen 361005, Peoples R China
[3] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne NE1 8ST, England
[4] Fujian Normal Univ, Coll Comp & Cyber Secur, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Transfer learning; Reinforcement learning; Distance measure; Markov decision process;
D O I
10.1016/j.neucom.2024.128071
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transfer reinforcement learning has gained significant traction in recent years as a critical research area, focusing on bolstering agents' decision-making prowess by harnessing insights from analogous tasks. The primary transfer learning method involves identifying the appropriate source domains, sharing specific knowledge structures and subsequently transferring the shared knowledge to novel tasks. However, existing transfer methods exhibit a pronounced dependency on high task similarity and an abundance of source data. Consequently, we attempt to formulate a more efficacious approach that optimally exploits the previous learning experiences to direct an agent's exploration as it learns new tasks. Specifically, we introduce a novel transfer learning paradigm rooted within the distance measure in the Markov chain, denoted as Distance Measure Substructure Transfer Reinforcement Learning (DMS-TRL). The core idea involves partitioning the Markov chain into the most basic small Markov units, which contain basic information about the agent's transfer between two states, and then followed by employing a new distance measure technique to find the most similar structure, which is also the most suitable for transfer. Finally, we propose a policy transfer method to transfer knowledge through the Q table from the selected Markov unit to the target task. Through a series of experiments conducted on discrete Gridworld scenarios, we compare our approach with state-of-the-art learning methods. The results clearly illustrate that DMS-TRL can adeptly identify optimal policy in target tasks, exhibiting swifter convergence.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Reinforcement learning based metric filtering for evolutionary distance metric learning
    Ali, Bassel
    Moriyama, Koichi
    Kalintha, Wasin
    Numao, Masayuki
    Fukui, Ken-Ichi
    INTELLIGENT DATA ANALYSIS, 2020, 24 (06) : 1345 - 1364
  • [2] A hybrid transfer algorithm for reinforcement learning based on spectral method
    Zhu, M.-Q. (zhumeiqiang@cumt.edu.cn), 1765, Science Press (38):
  • [3] Reinforcement Learning Based on Active Learning Method
    Sagha, Hesam
    Shouraki, Saeed Bagheri
    Khasteh, Hosein
    Kiaei, Ali Akbar
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 598 - +
  • [4] A transfer learning method for electric vehicles charging strategy based on deep reinforcement learning
    Wang, Kang
    Wang, Haixin
    Yang, Zihao
    Feng, Jiawei
    Li, Yanzhen
    Yang, Junyou
    Chen, Zhe
    APPLIED ENERGY, 2023, 343
  • [5] Learning to Predict Consequences as a Method of Knowledge Transfer in Reinforcement Learning
    Chalmers, Eric
    Contreras, Edgar Bermudez
    Robertson, Brandon
    Luczak, Artur
    Gruber, Aaron
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2259 - 2270
  • [6] A Transfer Metric Learning Method for Spammer Detection
    Chen, Hao
    Liu, Jun
    Lv, Yanzhang
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING: PAKDD 2018 WORKSHOPS, 2018, 11154 : 174 - 180
  • [7] Robust transfer learning based on Geometric Mean Metric Learning
    Zhao, Peng
    Wu, Tao
    Zhao, Shiyi
    Liu, Huiting
    KNOWLEDGE-BASED SYSTEMS, 2021, 227
  • [8] Review of Metric Learning with Transfer Learning
    Pan, Jiajun
    GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
  • [9] An Improved Reinforcement Learning Method Based on Unsupervised Learning
    Chang, Xin
    Li, Yanbin
    Zhang, Guanjie
    Liu, Donghui
    Fu, Changjun
    IEEE ACCESS, 2024, 12 : 12295 - 12307
  • [10] Knowledge Reasoning Method Based on Deep Transfer Reinforcement Learning: DTRLpath
    Lin, Shiming
    Ye, Ling
    Zhuang, Yijie
    Lu, Lingyun
    Zheng, Shaoqiu
    Huang, Chenxi
    Kwee, Ng Yin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 299 - 317