Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning

被引:0
|
作者
Ghasemipour, Seyed Kamyar Seyed [1 ]
Freeman, Daniel [1 ]
David, Byron [1 ]
Gu, Shixiang Shane [1 ]
Kataoka, Satoshi [1 ]
Mordatch, Igor [1 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Assembly of multi-part physical structures is both a valuable end product for autonomous robotics, as well as a valuable diagnostic task for open-ended training of embodied intelligent agents. We introduce a naturalistic physics-based environment with a set of connectable magnet blocks inspired by children's toy kits. The objective is to assemble blocks into a succession of target blueprints. Despite the simplicity of this objective, the compositional nature of building diverse blueprints from a set of blocks leads to an explosion of complexity in structures that agents encounter. Furthermore, assembly stresses agents' multi-step planning, physical reasoning, and bimanual coordination. We find that the combination of large-scale reinforcement learning and graph-based policies - surprisingly without any additional complexity - is an effective recipe for training agents that not only generalize to complex unseen blueprints in a zero-shot manner, but even operate in a reset-free setting without being trained to do so. Through extensive experiments, we highlight the importance of largescale training, structured representations, contributions of multi-task vs. single-task learning, as well as the effects of curriculums, and discuss qualitative behaviors of trained agents. Our accompanying project webpage can be found at: sites.google.com/view/learning-direct-assembly
引用
收藏
页数:35
相关论文
共 50 条
  • [41] Large-Scale Machine Learning Cluster Scheduling via Multi-Agent Graph Reinforcement Learning
    Zhao, Xiaoyang
    Wu, Chuan
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4962 - 4974
  • [42] Hybrid.AI: A Learning Search Engine for Large-scale Structured Data
    Soderman, Sean
    Kola, Anusha
    Podkorytov, Maksim
    Geyer, Michael
    Gubanov, Michael
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1507 - 1514
  • [43] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
    Baolai Wang
    Shengang Li
    Xianzhong Gao
    Tao Xie
    Applied Intelligence, 2023, 53 : 5274 - 5289
  • [44] Multi-Objective Distributional Reinforcement Learning for Large-Scale Order Dispatching
    Zhou, Fan
    Lu, Chenfan
    Tang, Xiaocheng
    Zhang, Fan
    Qin, Zhiwei
    Ye, Jieping
    Zhu, Hongtu
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1541 - 1546
  • [45] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
    Tan, Tian
    Bao, Feng
    Deng, Yue
    Jin, Alex
    Dai, Qionghai
    Wang, Jie
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
  • [46] DeepRoute on Chameleon: Experimenting with Large-scale Reinforcement Learning and SDN on Chameleon Testbed
    Mohammed, Bashir
    Kiran, Mariam
    Krishnaswamy, Nandini
    2019 IEEE 27TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (IEEE ICNP), 2019,
  • [47] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
    Wang, Baolai
    Li, Shengang
    Gao, Xianzhong
    Xie, Tao
    APPLIED INTELLIGENCE, 2023, 53 (05) : 5274 - 5289
  • [48] A multi-swarm optimizer with a reinforcement learning mechanism for large-scale optimization
    Wang, Xujie
    Wang, Feng
    He, Qi
    Guo, Yinan
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 86
  • [49] Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning
    Wang, Xiaoqiang
    Ke, Liangjun
    Qiao, Zhimin
    Chai, Xinghua
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) : 174 - 187
  • [50] Distributed Hierarchical Deep Reinforcement Learning for Large-Scale Grid Emergency Control
    Chen, Yixi
    Zhu, Jizhong
    Liu, Yun
    Zhang, Le
    Zhou, Jialin
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (02) : 4446 - 4458