GRIMGEP: Learning Progress for Robust Goal Sampling in Visual Deep Reinforcement Learning

被引:2
|
作者
Kovac, Grgur [1 ]
Laversanne-Finot, Adrien [1 ]
Oudeyer, Pierre-Yves [1 ]
机构
[1] INRIA Bordeaux, Flowers Lab, F-33400 Talence, France
关键词
Goal exploration; learning progress; reinforcement learning (RL); INTRINSIC MOTIVATION; EXPLORATION; SYSTEMS;
D O I
10.1109/TCDS.2022.3216911
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autotelic reinforcement learning (RL) agents sample their own goals, and try to reach them. They often prioritize goal sampling according to some intrinsic reward, ex. novelty or absolute learning progress (ALPs). Novelty-based approaches work robustly in unsupervised image-based environments when there are no distractors. However, they construct simple curricula that do not take the agent's performance into account: in complex environments, they often get attracted by impossible tasks. ALP-based approaches, which are often combined with a clustering mechanism, construct complex curricula tuned to the agent's current capabilities. Such curricula sample goals on which the agent is currently learning the most, and do not get attracted by impossible tasks. However, ALP approaches have not so far been applied to DRL agents perceiving complex environments directly in the image space. Goal regions guided intrinsically motivated goal exploration process (GRIMGEP), without using any expert knowledge, combines the ALP clustering approaches with novelty-based approaches and extends them to those complex scenarios. We experiment on a rich 3-D image-based environment with distractors using novelty-based exploration approaches: Skewfit and CountBased. We show that wrapping them with GRIMGEP-using them only in the cluster sampled by ALP-creates a better curriculum. The wrapped approaches are attracted less by the distractors, and achieve drastically better performances.
引用
收藏
页码:1396 / 1407
页数:12
相关论文
共 50 条
  • [21] Two-stage visual navigation by deep neural networks and multi-goal reinforcement learning
    Shantia, Amirhossein
    Timmers, Rik
    Chong, Yiebo
    Kuiper, Cornel
    Bidoia, Francesco
    Schomaker, Lambert
    Wiering, Marco
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 138
  • [22] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Eduardo F. Morales
    Rafael Murrieta-Cid
    Israel Becerra
    Marco A. Esquivel-Basaldua
    Intelligent Service Robotics, 2021, 14 : 773 - 805
  • [23] Robust Deep Reinforcement Learning for Traffic Signal Control
    Kai Liang Tan
    Anuj Sharma
    Soumik Sarkar
    Journal of Big Data Analytics in Transportation, 2020, 2 (3): : 263 - 274
  • [24] Robust deep reinforcement learning for personalized HVAC system
    Lim, Se-Heon
    Kim, Tae-Geun
    Yeom, Dongwoo Jason
    Yoon, Sung-Guk
    ENERGY AND BUILDINGS, 2024, 319
  • [25] Robust quadruped jumping via deep reinforcement learning
    Bellegarda, Guillaume
    Nguyen, Chuong
    Nguyen, Quan
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2024, 182
  • [26] Toward robust and scalable deep spiking reinforcement learning
    Akl, Mahmoud
    Ergene, Deniz
    Walter, Florian
    Knoll, Alois
    FRONTIERS IN NEUROROBOTICS, 2023, 16
  • [27] Robust Deep Reinforcement Learning through Adversarial Loss
    Oikarinen, Tuomas
    Zhang, Wang
    Megretski, Alexandre
    Daniel, Luca
    Weng, Tsui-Wei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [28] Deep Robust Reinforcement Learning for Practical Algorithmic Trading
    Li, Yang
    Zheng, Wanshan
    Zheng, Zibin
    IEEE ACCESS, 2019, 7 : 108014 - 108022
  • [29] Goal Recognition as Reinforcement Learning
    Amado, Leonardo
    Mirsky, Reuth
    Meneguzzi, Felipe
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9644 - 9651
  • [30] The Advance of Reinforcement Learning and Deep Reinforcement Learning
    Lyu, Le
    Shen, Yang
    Zhang, Sicheng
    2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 644 - 648