Shaped Policy Search for Evolutionary Strategies using Waypoints

被引:0
|
作者
Lekkala, Kiran [1 ]
Itti, Laurent [2 ]
机构
[1] Univ Southern Calif, ILab, Dept Comp Sci, Los Angeles, CA 90089 USA
[2] Univ Southern Calif, ILab, Dept Comp Sci Psychol & NGP, Los Angeles, CA 90089 USA
关键词
D O I
10.1109/ICRA48506.2021.9561607
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we try to improve exploration in Blackbox methods, particularly Evolution strategies (ES), when applied to Reinforcement Learning (RI.) problems where intermediate waypoints/subgoals are available. Since Evolutionary strategies are highly parallelizable, instead of extracting just a scalar cumulative reward, we use the state-action pairs from the trajectories obtained during rollouts/evaluations, to learn the dynamics of the agent. The learnt dynamics are then used in the optimization procedure to speed-up training. Lastly, we show how our proposed approach is universally applicable by presenting results from experiments conducted on Carla driving and UR5 robotic arm simulators.
引用
收藏
页码:9093 / 9100
页数:8
相关论文
共 50 条
  • [31] Optimization of Road Networks Using Evolutionary Strategies
    Schweitzer, Frank
    Ebeling, Werner
    Rose, Helge
    Weiss, Olaf
    EVOLUTIONARY COMPUTATION, 1997, 5 (04) : 419 - 438
  • [32] RELIABILITY OF TLP TETHERS USING EVOLUTIONARY STRATEGIES
    Barranco Cicilia, Federico
    Vazquez Hernandez, Alberto Omar
    OMAE 2008: PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON OFFSHORE MECHANICS AND ARCTIC ENGINEERING - 2008, VOL 2, 2008, : 1001 - 1007
  • [33] Hybridization of cognitive models using evolutionary strategies
    Romero Lopez, Oscar J.
    de Antonio Jimenez, Angelica
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 3213 - 3218
  • [34] Evolutionary Strategies of Supply Chain Finance From the Perspective of a Return Policy
    Zhang, Baojian
    Ye, Yang
    Yue, Xiaohang
    IEEE ACCESS, 2019, 7 : 110761 - 110769
  • [35] Stabilization of nonholonomic system using evolutionary strategies
    Vargas, Hector
    Alexandrov, Vladimir
    Zanella, Vittorio
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4499 - +
  • [36] Estimation of harmonic states using evolutionary strategies
    Arruda, E.F.
    Kagan, N.
    Controle y Automacao, 2009, 20 (02): : 177 - 191
  • [37] Optical fibre design using evolutionary strategies
    Manos, S
    Poladian, L
    ENGINEERING COMPUTATIONS, 2004, 21 (5-6) : 564 - 576
  • [38] Optimal Strategies for Multi Objective Games and Their Search by Evolutionary Multi Objective Optimization
    Avigad, G.
    Eisenstadt, E.
    Cohen, M. Weiss
    2011 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2011, : 166 - 173
  • [39] Proving Theorems by Using Evolutionary Search with Human Involvement
    Huang, Szu-Yi
    Chen, Ying-ping
    2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 1495 - 1502
  • [40] Using symmetry and evolutionary search to minimize sorting networks
    Valsalam, Vinod K.
    Miikkulainen, Risto
    Valsalam, V.K. (VKV@CS.UTEXAS.EDU), 1600, Microtome Publishing (14): : 303 - 331