Efficient Bimanual Handover and Rearrangement via Symmetry-Aware Actor-Critic Learning

被引：3

作者：

Li, Yunfei ^{[1
]}

Pan, Chaoyi ^{[2
]}

Xu, Huazhe ^{[1
,4
,5
]}

Wang, Xiaolong ^{[3
]}

Wu, Yi ^{[1
,5
]}

机构：

[1] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing, Peoples R China

[2] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China

[3] Univ Calif San Diego, Dept Elect & Comp Engn, San Diego, CA USA

[4] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China

[5] Shanghai Qi Zhi Inst, Shanghai, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA | 2023年

关键词：

D O I：

10.1109/ICRA48891.2023.10160739

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Bimanual manipulation is important for building intelligent robots that unlock richer skills than single arms. We consider a multi-object bimanual rearrangement task, where a reinforcement learning (RL) agent aims to jointly control two arms to rearrange these objects as fast as possible. Solving this task efficiently is challenging for an RL agent due to the requirement of discovering precise intra-arm coordination in an exponentially large control space. We develop a symmetry-aware actor-critic framework that leverages the interchangeable roles of the two manipulators in the bimanual control setting to reduce the policy search space. To handle the compositionality over multiple objects, we augment training data with an object-centric relabeling technique. The overall approach produces an RL policy that can rearrange up to 8 objects with a success rate of over 70% in simulation. We deploy the policy to two Franka Panda arms and further show a successful demo on human-robot collaboration. Videos can be found at https: //sites.google.com/view/bimanual.

引用

页码：3867 / 3874

页数：8

共 50 条

[1] Efficient Model Learning Methods for Actor-Critic Control
Grondman, Ivo
Vaandrager, Maarten
Busoniu, Lucian
Babuska, Robert
Schuitema, Erik
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 591 - 602
[2] Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
Abreu, Miguel
Reis, Luis Paulo
Lau, Nuno
NEUROCOMPUTING, 2025, 614
[3] Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning
Zhong, Shan
Liu, Quan
Fu, QiMing
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
[4] Efficient Actor-critic Algorithm with Dual Piecewise Model Learning
Zhong, Shan
Liu, Quan
Gong, Shengrong
Fu, Qiming
Xu, Jin
2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 823 - 830
[5] Granular computing in actor-critic learning
Peters, James F.
2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 59 - 64
[6] Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Wang, Zhihai
Wang, Jie
Zhou, Qi
Li, Bin
Li, Houqiang
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8612 - 8620
[7] Deep Reinforcement Learning in VizDoom via DQN and Actor-Critic Agents
Bakhanova, Maria
Makarov, Ilya
ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 138 - 150
[8] Efficient Sequence Labeling with Actor-Critic Training
Najafi, Saeed
Cherry, Colin
Kondrak, Grzegorz
ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 466 - 471
[9] Hierarchical Multiagent Formation Control Scheme via Actor-Critic Learning
Mu, Chaoxu
Peng, Jiangwen
Sun, Changyin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8764 - 8777
[10] Speed Tracking Control via Online Continuous Actor-Critic learning
Huang, Zhenhua
Xu, Xin
Sun, Zhenping
Tan, Jun
Qian, Lilin
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3172 - 3177

← 1 2 3 4 5 →