Solving Robotic Manipulation With Sparse Reward Reinforcement Learning Via Graph-Based Diversity and Proximity

被引：20

作者：

Bing, Zhenshan ^{[1
]}

Zhou, Hongkuan ^{[1
]}

Li, Rui ^{[2
]}

Su, Xiaojie ^{[2
]}

Morin, Fabrice O. ^{[1
]}

Huang, Kai ^{[3
]}

Knoll, Alois ^{[1
]}

机构：

[1] Tech Univ Munich, Dept Informat, D-80333 Munich, Germany

[2] Chongqing Univ, Sch Automat, Chongqing 400044, Peoples R China

[3] Sun Yat sen Univ, Sch Comp Sci, Guangzhou 510275, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS | 2023年 / 70卷 / 03期

基金：

欧盟地平线“2020”;

关键词：

Hindsight experience replay (HER); path planning; reinforcement learning; robotic arm manipulation;

D O I：

10.1109/TIE.2022.3172754

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In multigoal reinforcement learning (RL), algorithms usually suffer from inefficiency in the collection of successful experiences in tasks with sparse rewards. By utilizing the ideas of relabeling hindsight experience and curriculum learning, some prior works have greatly improved the sample efficiency in robotic manipulation tasks, such as hindsight experience replay (HER), hindsight goal generation (HGG), graph-based HGG (G-HGG), and curriculum-guided HER (CHER). However, none of these can learn efficiently to solve challenging manipulation tasks with distant goals and obstacles, since they rely either on heuristic or simple distance-guided exploration. In this article, we introduce graph-curriculum-guided HGG (GC-HGG), an extension of CHER and G-HGG, which works by selecting hindsight goals on the basis of graph-based proximity and diversity. We evaluated GC-HGG in four challenging manipulation tasks involving obstacles in both simulations and real-world experiments, in which significant enhancements in both sample efficiency and overall success rates over prior works were demonstrated. Videos and codes can be viewed at this link: https://videoviewsite.wixsite.com/gc-hgg.

引用

页码：2759 / 2769

页数：11

共 50 条

[1] Learning to Decode: Reinforcement Learning for Decoding of Sparse Graph-Based Channel Codes
Habib, Salman
Beemer, Allison
Kliewer, Jorg
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[2] Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation
Bing, Zhenshan
Brucker, Matthias
Morin, Fabrice O.
Li, Rui
Su, Xiaojie
Huang, Kai
Knoll, Alois
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7863 - 7876
[3] Dexterous robotic manipulation using deep reinforcement learning and knowledge transfer for complex sparse reward-based tasks
Wang, Qiang
Sanchez, Francisco Roldan
McCarthy, Robert
Bulens, David Cordova
McGuinness, Kevin
O'Connor, Noel
Wuthrich, Manuel
Widmaier, Felix
Bauer, Stefan
Redmond, Stephen J.
EXPERT SYSTEMS, 2023, 40 (06)
[4] Curriculum Learning Algorithms for Reward Weighting in Sparse Reward Robotic Manipulation Tasks
Fele, Benjamin
Babic, Jan
IEEE ACCESS, 2025, 13 : 45544 - 45558
[5] A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning
Fu, Qingxu
Qiu, Tenghai
Pu, Zhiqiang
Yi, Jianqiang
Yuan, Wanmai
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[6] Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning
Deng, Yang
Li, Yaliang
Sun, Fei
Ding, Bolin
Lam, Wai
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1431 - 1441
[7] Decentralized graph-based multi-agent reinforcement learning using reward machines
Hu, Jueming
Xu, Zhe
Wang, Weichang
Qu, Guannan
Pang, Yutian
Liu, Yongming
NEUROCOMPUTING, 2024, 564
[8] DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning
Lee, Seungjae
Kim, Jigang
Jang, Inkyu
Kim, H. Jin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[9] Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning
Li, Yang
Luo, Xiangfeng
Xie, Shaorong
2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 709 - 713
[10] Graph-Based Skill Acquisition For Reinforcement Learning
Mendonca, Matheus R. F.
Ziviani, Artur
Barreto, Andre M. S.
ACM COMPUTING SURVEYS, 2019, 52 (01)

← 1 2 3 4 5 →