Solving Robotic Manipulation With Sparse Reward Reinforcement Learning Via Graph-Based Diversity and Proximity

被引:20
|
作者
Bing, Zhenshan [1 ]
Zhou, Hongkuan [1 ]
Li, Rui [2 ]
Su, Xiaojie [2 ]
Morin, Fabrice O. [1 ]
Huang, Kai [3 ]
Knoll, Alois [1 ]
机构
[1] Tech Univ Munich, Dept Informat, D-80333 Munich, Germany
[2] Chongqing Univ, Sch Automat, Chongqing 400044, Peoples R China
[3] Sun Yat sen Univ, Sch Comp Sci, Guangzhou 510275, Peoples R China
基金
欧盟地平线“2020”;
关键词
Hindsight experience replay (HER); path planning; reinforcement learning; robotic arm manipulation;
D O I
10.1109/TIE.2022.3172754
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In multigoal reinforcement learning (RL), algorithms usually suffer from inefficiency in the collection of successful experiences in tasks with sparse rewards. By utilizing the ideas of relabeling hindsight experience and curriculum learning, some prior works have greatly improved the sample efficiency in robotic manipulation tasks, such as hindsight experience replay (HER), hindsight goal generation (HGG), graph-based HGG (G-HGG), and curriculum-guided HER (CHER). However, none of these can learn efficiently to solve challenging manipulation tasks with distant goals and obstacles, since they rely either on heuristic or simple distance-guided exploration. In this article, we introduce graph-curriculum-guided HGG (GC-HGG), an extension of CHER and G-HGG, which works by selecting hindsight goals on the basis of graph-based proximity and diversity. We evaluated GC-HGG in four challenging manipulation tasks involving obstacles in both simulations and real-world experiments, in which significant enhancements in both sample efficiency and overall success rates over prior works were demonstrated. Videos and codes can be viewed at this link: https://videoviewsite.wixsite.com/gc-hgg.
引用
收藏
页码:2759 / 2769
页数:11
相关论文
共 50 条
  • [1] Learning to Decode: Reinforcement Learning for Decoding of Sparse Graph-Based Channel Codes
    Habib, Salman
    Beemer, Allison
    Kliewer, Jorg
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation
    Bing, Zhenshan
    Brucker, Matthias
    Morin, Fabrice O.
    Li, Rui
    Su, Xiaojie
    Huang, Kai
    Knoll, Alois
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7863 - 7876
  • [3] Dexterous robotic manipulation using deep reinforcement learning and knowledge transfer for complex sparse reward-based tasks
    Wang, Qiang
    Sanchez, Francisco Roldan
    McCarthy, Robert
    Bulens, David Cordova
    McGuinness, Kevin
    O'Connor, Noel
    Wuthrich, Manuel
    Widmaier, Felix
    Bauer, Stefan
    Redmond, Stephen J.
    EXPERT SYSTEMS, 2023, 40 (06)
  • [4] Curriculum Learning Algorithms for Reward Weighting in Sparse Reward Robotic Manipulation Tasks
    Fele, Benjamin
    Babic, Jan
    IEEE ACCESS, 2025, 13 : 45544 - 45558
  • [5] A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning
    Fu, Qingxu
    Qiu, Tenghai
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning
    Deng, Yang
    Li, Yaliang
    Sun, Fei
    Ding, Bolin
    Lam, Wai
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1431 - 1441
  • [7] Decentralized graph-based multi-agent reinforcement learning using reward machines
    Hu, Jueming
    Xu, Zhe
    Wang, Weichang
    Qu, Guannan
    Pang, Yutian
    Liu, Yongming
    NEUROCOMPUTING, 2024, 564
  • [8] DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning
    Lee, Seungjae
    Kim, Jigang
    Jang, Inkyu
    Kim, H. Jin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [9] Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning
    Li, Yang
    Luo, Xiangfeng
    Xie, Shaorong
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 709 - 713
  • [10] Graph-Based Skill Acquisition For Reinforcement Learning
    Mendonca, Matheus R. F.
    Ziviani, Artur
    Barreto, Andre M. S.
    ACM COMPUTING SURVEYS, 2019, 52 (01)