A Reinforcement Learning Method for Continuous Domains Using Artificial Hydrocarbon Networks

被引:0
|
作者
Ponce, Hiram [1 ]
Gonzalez-Mora, Guillermo [1 ]
Martinez-Villasenor, Lourdes [1 ]
机构
[1] Univ Panamer, Fac Ingn, Augusto Rodin 498, Ciudad De Mexico 03920, Mexico
关键词
reinforcement learning; artificial hydrocarbon networks; artificial organic networks; continuous domain; policy search;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning in continuous states and actions has been limitedly studied in ocassions given difficulties in the determination of the transition function, lack of performance in continuous-to-discrete relaxation problems, among others. For instance, real-world problems, e.g. robotics, require these methods for learning complex tasks. Thus, in this paper, we propose a method for reinforcement learning with continuous states and actions using a model-based approach learned with artificial hydrocarbon networks (AHN). The proposed method considers modeling the dynamics of the continuous task with the supervised AHN method. Initial random rollouts and posterior data collection from policy evaluation improve the training of the AHN-based dynamics model. Preliminary results over the well-known mountain car task showed that artificial hydrocarbon networks can contribute to model-based approaches in continuous RL problems in both estimation efficiency (0.0012 in root mean squared-error) and sub-optimal policy convergence (reached in 357 steps), in just 5 trials over a parameter space theta is an element of R-86. Data from experimental results are available at: http://sites. google.com/up.edu.mx/reinforcement learning/.
引用
收藏
页码:398 / 403
页数:6
相关论文
共 50 条
  • [41] Two Steps Reinforcement Learning in Continuous Reinforcement Learning Tasks
    Lopez-Bueno, Ivan
    Garcia, Javier
    Fernandez, Fernando
    BIO-INSPIRED SYSTEMS: COMPUTATIONAL AND AMBIENT INTELLIGENCE, PT 1, 2009, 5517 : 577 - 584
  • [42] A Logic Optimization Method Using Reinforcement Learning
    Cai, Yuting
    Wu, Yue
    Yang, Xiaoyan
    Chu, Zhufei
    2024 INTERNATIONAL SYMPOSIUM OF ELECTRONICS DESIGN AUTOMATION, ISEDA 2024, 2024, : 312 - 317
  • [43] Study of Lightweighting Method Using Reinforcement Learning
    Harada, Yoshihiro
    Yata, Noriko
    Manabe, Yoshitsugu
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [44] Reinforcement learning with artificial microswimmers
    Muiños-Landin S.
    Fischer A.
    Holubec V.
    Cichos F.
    Science Robotics, 2020, 6 (52):
  • [45] Reinforcement learning and artificial agency
    Butlin, Patrick
    MIND & LANGUAGE, 2024, 39 (01) : 22 - 38
  • [46] Reinforcement learning with artificial microswimmers
    Muinos-Landin, S.
    Fischer, A.
    Holubec, V.
    Cichos, F.
    SCIENCE ROBOTICS, 2021, 6 (52)
  • [47] Artificial Conversational Agent using Robust Adversarial Reinforcement Learning
    Wadekar, Isha
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [48] Using Deep Reinforcement Learning for Routing in IP Networks
    Singh, Abhiram
    Sharma, Sidharth
    Gumaste, Ashwin
    30TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2021), 2021,
  • [49] Opinion Shaping in Social Networks Using Reinforcement Learning
    Borkar, Vivek S.
    Reiffers-Masson, Alexandre
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (03): : 1305 - 1316
  • [50] Workflow scheduling using Neural Networks and Reinforcement Learning
    Melnik, Mikhail
    Nasonov, Denis
    8TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE ON COMPUTATIONAL SCIENCE, YSC2019, 2019, 156 : 29 - 36