Efficient Bimanual Handover and Rearrangement via Symmetry-Aware Actor-Critic Learning

被引：3

作者：

Li, Yunfei ^{[1
]}

Pan, Chaoyi ^{[2
]}

Xu, Huazhe ^{[1
,4
,5
]}

Wang, Xiaolong ^{[3
]}

Wu, Yi ^{[1
,5
]}

机构：

[1] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing, Peoples R China

[2] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China

[3] Univ Calif San Diego, Dept Elect & Comp Engn, San Diego, CA USA

[4] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China

[5] Shanghai Qi Zhi Inst, Shanghai, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA | 2023年

关键词：

D O I：

10.1109/ICRA48891.2023.10160739

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Bimanual manipulation is important for building intelligent robots that unlock richer skills than single arms. We consider a multi-object bimanual rearrangement task, where a reinforcement learning (RL) agent aims to jointly control two arms to rearrange these objects as fast as possible. Solving this task efficiently is challenging for an RL agent due to the requirement of discovering precise intra-arm coordination in an exponentially large control space. We develop a symmetry-aware actor-critic framework that leverages the interchangeable roles of the two manipulators in the bimanual control setting to reduce the policy search space. To handle the compositionality over multiple objects, we augment training data with an object-centric relabeling technique. The overall approach produces an RL policy that can rearrange up to 8 objects with a success rate of over 70% in simulation. We deploy the policy to two Franka Panda arms and further show a successful demo on human-robot collaboration. Videos can be found at https: //sites.google.com/view/bimanual.

引用

页码：3867 / 3874

页数：8

共 50 条

[11] Learning Locomotion for Quadruped Robots via Distributional Ensemble Actor-Critic
Li, Sicen
Pang, Yiming
Bai, Panju
Li, Jiawei
Liu, Zhaojin
Hu, Shihao
Wang, Liquan
Wang, Gang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1811 - 1818
[12] Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning
Shi, Daming
Guo, Xudong
Liu, Yi
Fan, Wenhui
ENTROPY, 2022, 24 (06)
[13] Multi-actor mechanism for actor-critic reinforcement learning
Li, Lin
Li, Yuze
Wei, Wei
Zhang, Yujia
Liang, Jiye
INFORMATION SCIENCES, 2023, 647
[14] A World Model for Actor-Critic in Reinforcement Learning
Panov, A. I.
Ugadiarov, L. A.
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 467 - 477
[15] Curious Hierarchical Actor-Critic Reinforcement Learning
Roeder, Frank
Eppe, Manfred
Nguyen, Phuong D. H.
Wermter, Stefan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 408 - 419
[16] Actor-Critic based Improper Reinforcement Learning
Zaki, Mohammadi
Mohan, Avinash
Gopalan, Aditya
Mannor, Shie
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[17] Integrated Actor-Critic for Deep Reinforcement Learning
Zheng, Jiaohao
Kurt, Mehmet Necip
Wang, Xiaodong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518
[18] A fuzzy Actor-Critic reinforcement learning network
Wang, Xue-Song
Cheng, Yu-Hu
Yi, Jian-Qiang
INFORMATION SCIENCES, 2007, 177 (18) : 3764 - 3781
[19] A modified actor-critic reinforcement learning algorithm
Mustapha, SM
Lachiver, G
2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 605 - 609
[20] Research on actor-critic reinforcement learning in RoboCup
Guo, He
Liu, Tianying
Wang, Yuxin
Chen, Feng
Fan, Jianming
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 205 - 205

← 1 2 3 4 5 →