Adaptive Model Learning method for Reinforcement Learning

被引:0
|
作者
Hwang, Kao-Shing [1 ]
Jiang, Wei-Cheng [2 ]
Chen, Yu-Jen [2 ]
机构
[1] Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan
[2] Natl Chung Cheng Univ, Dept Elect Engn, Chiayi, Taiwan
关键词
adaptive model learning method; Dyna-Q agent; Reinforcement learning;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The original Q-learning method is difficult on achieving sample efficiency such as training a policy to get to a goal with in limited time step. So, the Dyna-Q agent is proposed to speed up the policy learning. However, the Dyna-Q did not specify how to build the model, so the table is used to be the model largely. In this paper, we proposed an adaptive model learning method based on tree structures and combined with Q-Learning to form Tree-Based Dyna-Q agent to enhance the policy learning. When the tree-based model learns an accurate model, a planning method can use the model to produce simulated experiences to accelerate value iterations. Thus, the agent with the proposed method can obtain virtual experiences for updating the policy. The simulation result shows that training time of our method can improve obviously.
引用
收藏
页码:1277 / 1280
页数:4
相关论文
共 50 条
  • [21] An Adaptive Model-Free Control Method for Metro Train Based on Deep Reinforcement Learning
    Lai, Wenzhu
    Chen, Dewang
    Huang, Yunhu
    Huang, Benzun
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 263 - 273
  • [22] A Reinforcement Learning based Adaptive Supervisor for Multiple Model Adaptive Estimation and Control
    Renwick, Zachary
    Tilbury, Dawn
    Atkins, Ella
    2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 216 - 221
  • [23] Preferential Exploration Method of Transfer Learning for Reinforcement Learning in Same Transition Model
    Takano, Toshiaki
    Takase, Haruhiko
    Kawanaka, Hiroharu
    Tsuruoka, Shinji
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 2099 - 2103
  • [24] Reinforcement Learning Method with Internal World Model Training
    Hirata, Kenji
    Iizuka, Hiroyuki
    Yamamoto, Masahito
    2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 201 - 204
  • [25] An Integrated Design Method using Reinforcement Learning Model
    Hao, Yingjun
    Wan, Shaosong
    Cao, Jian
    Yan, Cong
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND COMPUTING TECHNOLOGY, 2015, 30 : 1077 - 1080
  • [26] Reinforcement learning method based on sample regularization and adaptive learning rate for AGV path planning
    Nie, Jun
    Zhang, Guihua
    Lu, Xiao
    Wang, Haixia
    Sheng, Chunyang
    Sun, Lijie
    NEUROCOMPUTING, 2025, 614
  • [27] Reinforcement Learning Based on Active Learning Method
    Sagha, Hesam
    Shouraki, Saeed Bagheri
    Khasteh, Hosein
    Kiaei, Ali Akbar
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 598 - +
  • [28] The Method of Learning Personal Preference with Reinforcement Learning
    Park, Yong Hee
    Choi, Won Seok
    Choi, Seong Gon
    2022 24TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ARITIFLCIAL INTELLIGENCE TECHNOLOGIES TOWARD CYBERSECURITY, 2022, : 73 - +
  • [29] Computerized Adaptive Testing Method Based on Reinforcement Learning for Series Diagnosis
    Liu Z.
    Wu J.
    Yao F.
    Liu Q.
    Chen E.
    Sha J.
    Wang S.
    Su Y.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (01): : 13 - 26
  • [30] An Adaptive Strategy Selection Method With Reinforcement Learning for Robotic Soccer Games
    Shi, Haobin
    Lin, Zhiqiang
    Hwang, Kao-Shing
    Yang, Shike
    Chen, Jialin
    IEEE ACCESS, 2018, 6 : 8376 - 8386