Efficient Bimanual Handover and Rearrangement via Symmetry-Aware Actor-Critic Learning

被引:3
|
作者
Li, Yunfei [1 ]
Pan, Chaoyi [2 ]
Xu, Huazhe [1 ,4 ,5 ]
Wang, Xiaolong [3 ]
Wu, Yi [1 ,5 ]
机构
[1] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing, Peoples R China
[2] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[3] Univ Calif San Diego, Dept Elect & Comp Engn, San Diego, CA USA
[4] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
[5] Shanghai Qi Zhi Inst, Shanghai, Peoples R China
关键词
D O I
10.1109/ICRA48891.2023.10160739
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bimanual manipulation is important for building intelligent robots that unlock richer skills than single arms. We consider a multi-object bimanual rearrangement task, where a reinforcement learning (RL) agent aims to jointly control two arms to rearrange these objects as fast as possible. Solving this task efficiently is challenging for an RL agent due to the requirement of discovering precise intra-arm coordination in an exponentially large control space. We develop a symmetry-aware actor-critic framework that leverages the interchangeable roles of the two manipulators in the bimanual control setting to reduce the policy search space. To handle the compositionality over multiple objects, we augment training data with an object-centric relabeling technique. The overall approach produces an RL policy that can rearrange up to 8 objects with a success rate of over 70% in simulation. We deploy the policy to two Franka Panda arms and further show a successful demo on human-robot collaboration. Videos can be found at https: //sites.google.com/view/bimanual.
引用
收藏
页码:3867 / 3874
页数:8
相关论文
共 50 条
  • [1] Efficient Model Learning Methods for Actor-Critic Control
    Grondman, Ivo
    Vaandrager, Maarten
    Busoniu, Lucian
    Babuska, Robert
    Schuitema, Erik
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (03): : 591 - 602
  • [2] Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
    Abreu, Miguel
    Reis, Luis Paulo
    Lau, Nuno
    NEUROCOMPUTING, 2025, 614
  • [3] Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning
    Zhong, Shan
    Liu, Quan
    Fu, QiMing
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [4] Efficient Actor-critic Algorithm with Dual Piecewise Model Learning
    Zhong, Shan
    Liu, Quan
    Gong, Shengrong
    Fu, Qiming
    Xu, Jin
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 823 - 830
  • [5] Granular computing in actor-critic learning
    Peters, James F.
    2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 59 - 64
  • [6] Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
    Wang, Zhihai
    Wang, Jie
    Zhou, Qi
    Li, Bin
    Li, Houqiang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8612 - 8620
  • [7] Deep Reinforcement Learning in VizDoom via DQN and Actor-Critic Agents
    Bakhanova, Maria
    Makarov, Ilya
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 138 - 150
  • [8] Efficient Sequence Labeling with Actor-Critic Training
    Najafi, Saeed
    Cherry, Colin
    Kondrak, Grzegorz
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 466 - 471
  • [9] Hierarchical Multiagent Formation Control Scheme via Actor-Critic Learning
    Mu, Chaoxu
    Peng, Jiangwen
    Sun, Changyin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8764 - 8777
  • [10] Speed Tracking Control via Online Continuous Actor-Critic learning
    Huang, Zhenhua
    Xu, Xin
    Sun, Zhenping
    Tan, Jun
    Qian, Lilin
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3172 - 3177