Goal-driven active learning

被引:0
|
作者
Nicolas Bougie
Ryutaro Ichise
机构
[1] The Graduate University for Advanced Studies (Sokendai),
[2] National Institute of Informatics,undefined
关键词
Deep reinforcement learning; Imitation learning; Goal-conditioned learning; Active learning;
D O I
暂无
中图分类号
学科分类号
摘要
Deep reinforcement learning methods have achieved significant successes in complex decision-making problems. In fact, they traditionally rely on well-designed extrinsic rewards, which limits their applicability to many real-world tasks where rewards are naturally sparse. While cloning behaviors provided by an expert is a promising approach to the exploration problem, learning from a fixed set of demonstrations may be impracticable due to lack of state coverage or distribution mismatch—when the learner’s goal deviates from the demonstrated behaviors. Besides, we are interested in learning how to reach a wide range of goals from the same set of demonstrations. In this work we propose a novel goal-conditioned method that leverages very small sets of goal-driven demonstrations to massively accelerate the learning process. Crucially, we introduce the concept of active goal-driven demonstrations to query the demonstrator only in hard-to-learn and uncertain regions of the state space. We further present a strategy for prioritizing sampling of goals where the disagreement between the expert and the policy is maximized. We evaluate our method on a variety of benchmark environments from the Mujoco domain. Experimental results show that our method outperforms prior imitation learning approaches in most of the tasks in terms of exploration efficiency and average scores.
引用
收藏
相关论文
共 50 条
  • [1] Goal-driven active learning
    Bougie, Nicolas
    Ichise, Ryutaro
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2021, 35 (02)
  • [2] Goal-Driven Dimensionality Reduction for Reinforcement Learning
    Parisi, Simone
    Ramstedt, Simon
    Peters, Jan
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 4634 - 4639
  • [3] Goal-driven modeling
    Bock, C
    JOOP-JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 2000, 13 (05): : 48 - +
  • [4] Goal-driven modeling
    Bock, Conrad
    JOOP - Journal of Object-Oriented Programming, 2000, 13 (05): : 48 - 56
  • [5] Goal-Driven Optimization
    Chen, Wenqing
    Sim, Melvyn
    OPERATIONS RESEARCH, 2009, 57 (02) : 342 - 357
  • [6] GOAL-DRIVEN LEARNING - FUNDAMENTAL ISSUES - A SYMPOSIUM REPORT
    LEAKE, D
    RAM, A
    AI MAGAZINE, 1993, 14 (04) : 67 - 72
  • [7] LEARNING, GOALS, AND LEARNING-GOALS - A PERSPECTIVE ON GOAL-DRIVEN LEARNING
    LEAKE, DB
    RAM, A
    ARTIFICIAL INTELLIGENCE REVIEW, 1995, 9 (06) : 387 - 422
  • [8] Goal-Driven Learning in the GILA Integrated Intelligence Architecture
    Radhakrishnan, Jainarayan
    Ontanon, Santiago
    Ram, Ashwin
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1205 - 1210
  • [9] Goal-Driven Dynamics Learning via Bayesian Optimization
    Bansal, Somil
    Calandra, Roberto
    Xiao, Ted
    Levine, Sergey
    Tomlin, Claire J.
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [10] Goal-Driven Atari Environment
    Kim, Myeong Hyeon
    Kim, Dongjae
    Jo, Eunsong
    Lee, Sang Wan
    10TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE (BCI2022), 2022,