Shaped Policy Search for Evolutionary Strategies using Waypoints

被引:0
|
作者
Lekkala, Kiran [1 ]
Itti, Laurent [2 ]
机构
[1] Univ Southern Calif, ILab, Dept Comp Sci, Los Angeles, CA 90089 USA
[2] Univ Southern Calif, ILab, Dept Comp Sci Psychol & NGP, Los Angeles, CA 90089 USA
关键词
D O I
10.1109/ICRA48506.2021.9561607
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we try to improve exploration in Blackbox methods, particularly Evolution strategies (ES), when applied to Reinforcement Learning (RI.) problems where intermediate waypoints/subgoals are available. Since Evolutionary strategies are highly parallelizable, instead of extracting just a scalar cumulative reward, we use the state-action pairs from the trajectories obtained during rollouts/evaluations, to learn the dynamics of the agent. The learnt dynamics are then used in the optimization procedure to speed-up training. Lastly, we show how our proposed approach is universally applicable by presenting results from experiments conducted on Carla driving and UR5 robotic arm simulators.
引用
收藏
页码:9093 / 9100
页数:8
相关论文
共 50 条
  • [1] Evolutionary strategies in environmental policy
    Ring, I
    ECOLOGICAL ECONOMICS, 1997, 23 (03) : 237 - 249
  • [2] Designing a value based niche search engine using evolutionary strategies
    Sengupta, S
    Jansen, BJ
    ITCC 2005: International Conference on Information Technology: Coding and Computing, Vol 1, 2005, : 800 - 805
  • [3] RF Signal Source Search and Localization Using an Autonomous UAV with Predefined Waypoints
    Kwon, Hyeokjun
    Guvenc, Ismail
    2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,
  • [4] Evolution Strategies for Direct Policy Search
    Heidrich-Meisner, Verena
    Igel, Christian
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN X, PROCEEDINGS, 2008, 5199 : 428 - 437
  • [5] Bioeconomy and the Common Agricultural Policy: will a strategy in search of policies meet a policy in search of strategies?
    Viaggi, Davide
    BIO-BASED AND APPLIED ECONOMICS, 2018, 7 (02): : 179 - 190
  • [6] Evolutionary computing strategies for preliminary design search and exploration
    Parmee, IC
    OPTIMAIZATION IN INDUSTRY, 2002, : 215 - 227
  • [7] Generalized Early Stopping in Evolutionary Direct Policy Search
    Arza, Etor
    Le Goff, Léni K.
    Hart, Emma
    ACM Transactions on Evolutionary Learning and Optimization, 2024, 4 (03):
  • [8] Quality with Just Enough Diversity in Evolutionary Policy Search
    Templier, Paul
    Grillotti, Luca
    Rachelson, Emmanuel
    Wilson, Dennis G.
    Cully, Antoine
    PROCEEDINGS OF THE 2024 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2024, 2024, : 105 - 113
  • [9] Evolutionary and Principled Search Strategies for Sensornet Protocol Optimization
    Tate, Jonathan
    Woolford-Lim, Benjamin
    Bate, Iain
    Yao, Xin
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (01): : 163 - 180
  • [10] Combining Evolutionary and Sequential Search Strategies for Unsupervised Feature Selection
    Klepaczko, Artur
    Materka, Andrzej
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT II, 2010, 6114 : 149 - 156