A Reinforcement Learning Method for Continuous Domains Using Artificial Hydrocarbon Networks

被引:0
|
作者
Ponce, Hiram [1 ]
Gonzalez-Mora, Guillermo [1 ]
Martinez-Villasenor, Lourdes [1 ]
机构
[1] Univ Panamer, Fac Ingn, Augusto Rodin 498, Ciudad De Mexico 03920, Mexico
关键词
reinforcement learning; artificial hydrocarbon networks; artificial organic networks; continuous domain; policy search;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning in continuous states and actions has been limitedly studied in ocassions given difficulties in the determination of the transition function, lack of performance in continuous-to-discrete relaxation problems, among others. For instance, real-world problems, e.g. robotics, require these methods for learning complex tasks. Thus, in this paper, we propose a method for reinforcement learning with continuous states and actions using a model-based approach learned with artificial hydrocarbon networks (AHN). The proposed method considers modeling the dynamics of the continuous task with the supervised AHN method. Initial random rollouts and posterior data collection from policy evaluation improve the training of the AHN-based dynamics model. Preliminary results over the well-known mountain car task showed that artificial hydrocarbon networks can contribute to model-based approaches in continuous RL problems in both estimation efficiency (0.0012 in root mean squared-error) and sub-optimal policy convergence (reached in 357 steps), in just 5 trials over a parameter space theta is an element of R-86. Data from experimental results are available at: http://sites. google.com/up.edu.mx/reinforcement learning/.
引用
收藏
页码:398 / 403
页数:6
相关论文
共 50 条
  • [1] A Novel Artificial Hydrocarbon Networks Based Value Function Approximation in Hierarchical Reinforcement Learning
    Ponce, Hiram
    ADVANCES IN SOFT COMPUTING, MICAI 2016, PT II, 2017, 10062 : 211 - 225
  • [2] A multi-agent fuzzy-reinforcement learning method for continuous domains
    Duman, E
    Kaya, M
    Akin, E
    MULTI-AGENT SYSTEMS AND APPLICATIONS IV, PROCEEDINGS, 2005, 3690 : 306 - 315
  • [3] Versatility of Artificial Hydrocarbon Networks for Supervised Learning
    Ponce, Hiram
    Martinez-Villasenor, Ma Lourdes
    ADVANCES IN SOFT COMPUTING, MICAI 2017, PT I, 2018, 10632 : 3 - 16
  • [4] EANT plus KALMAN: An Efficient Reinforcement Learning Method for Continuous State Partially Observable Domains
    Kassahun, Yohannes
    de Gea, Jose
    Metzen, Jan Hendrik
    Edgington, Mark
    Kirchner, Frank
    KI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5243 : 241 - +
  • [5] Building Adaptive Tutoring Model using Artificial Neural Networks and Reinforcement Learning
    Fenza, Giuseppe
    Orciuoli, Francesco
    Sampson, Demetrios G.
    2017 IEEE 17TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT), 2017, : 460 - 462
  • [6] A Comparative Analysis of Evolutionary Learning in Artificial Hydrocarbon Networks
    Ponce, Hiram
    Souza, Paulo
    ADVANCES IN SOFT COMPUTING, MICAI 2020, PT I, 2020, 12468 : 223 - 234
  • [7] Modeling a System for Monitoring an Object Using Artificial Neural Networks and Reinforcement Learning
    Peixoto, H. M.
    Diniz, A. A. R.
    Almeida, N. C.
    de Melo, J. D.
    Doria Neto, A. D.
    Guerreiro, A. M. G.
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2327 - 2332
  • [8] Automatic chemical process control using reinforcement learning in artificial neural networks
    Hoskins, J.C.
    Himmelblau, D.M.
    Neural Networks, 1988, 1 (1 SUPPL)
  • [9] Graph based skill acquisition and transfer Learning for continuous reinforcement learning domains
    Shoeleh, Farzaneh
    Asadpour, Masoud
    PATTERN RECOGNITION LETTERS, 2017, 87 : 104 - 116
  • [10] Skill based transfer learning with domain adaptation for continuous reinforcement learning domains
    Farzaneh Shoeleh
    Masoud Asadpour
    Applied Intelligence, 2020, 50 : 502 - 518