Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning

被引：0

作者：

Ghasemipour, Seyed Kamyar Seyed ^{[1
]}

Freeman, Daniel ^{[1
]}

David, Byron ^{[1
]}

Gu, Shixiang Shane ^{[1
]}

Kataoka, Satoshi ^{[1
]}

Mordatch, Igor ^{[1
]}

机构：

[1] Google Res, Mountain View, CA 94043 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Assembly of multi-part physical structures is both a valuable end product for autonomous robotics, as well as a valuable diagnostic task for open-ended training of embodied intelligent agents. We introduce a naturalistic physics-based environment with a set of connectable magnet blocks inspired by children's toy kits. The objective is to assemble blocks into a succession of target blueprints. Despite the simplicity of this objective, the compositional nature of building diverse blueprints from a set of blocks leads to an explosion of complexity in structures that agents encounter. Furthermore, assembly stresses agents' multi-step planning, physical reasoning, and bimanual coordination. We find that the combination of large-scale reinforcement learning and graph-based policies - surprisingly without any additional complexity - is an effective recipe for training agents that not only generalize to complex unseen blueprints in a zero-shot manner, but even operate in a reset-free setting without being trained to do so. Through extensive experiments, we highlight the importance of largescale training, structured representations, contributions of multi-task vs. single-task learning, as well as the effects of curriculums, and discuss qualitative behaviors of trained agents. Our accompanying project webpage can be found at: sites.google.com/view/learning-direct-assembly

引用

页数：35

共 50 条

[41] Large-Scale Machine Learning Cluster Scheduling via Multi-Agent Graph Reinforcement Learning
Zhao, Xiaoyang
Wu, Chuan
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4962 - 4974
[42] Hybrid.AI: A Learning Search Engine for Large-scale Structured Data
Soderman, Sean
Kola, Anusha
Podkorytov, Maksim
Geyer, Michael
Gubanov, Michael
COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1507 - 1514
[43] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
Baolai Wang
Shengang Li
Xianzhong Gao
Tao Xie
Applied Intelligence, 2023, 53 : 5274 - 5289
[44] Multi-Objective Distributional Reinforcement Learning for Large-Scale Order Dispatching
Zhou, Fan
Lu, Chenfan
Tang, Xiaocheng
Zhang, Fan
Qin, Zhiwei
Ye, Jieping
Zhu, Hongtu
2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1541 - 1546
[45] Cooperative Deep Reinforcement Learning for Large-Scale Traffic Grid Signal Control
Tan, Tian
Bao, Feng
Deng, Yue
Jin, Alex
Dai, Qionghai
Wang, Jie
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2687 - 2700
[46] DeepRoute on Chameleon: Experimenting with Large-scale Reinforcement Learning and SDN on Chameleon Testbed
Mohammed, Bashir
Kiran, Mariam
Krishnaswamy, Nandini
2019 IEEE 27TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (IEEE ICNP), 2019,
[47] Weighted mean field reinforcement learning for large-scale UAV swarm confrontation
Wang, Baolai
Li, Shengang
Gao, Xianzhong
Xie, Tao
APPLIED INTELLIGENCE, 2023, 53 (05) : 5274 - 5289
[48] A multi-swarm optimizer with a reinforcement learning mechanism for large-scale optimization
Wang, Xujie
Wang, Feng
He, Qi
Guo, Yinan
SWARM AND EVOLUTIONARY COMPUTATION, 2024, 86
[49] Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning
Wang, Xiaoqiang
Ke, Liangjun
Qiao, Zhimin
Chai, Xinghua
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) : 174 - 187
[50] Distributed Hierarchical Deep Reinforcement Learning for Large-Scale Grid Emergency Control
Chen, Yixi
Zhu, Jizhong
Liu, Yun
Zhang, Le
Zhou, Jialin
IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (02) : 4446 - 4458

← 1 2 3 4 5 →