Dynamics Learning With Object-Centric Interaction Networks for Robot Manipulation

被引:7
|
作者
Wang, Jiayu [1 ,2 ]
Hu, Chuxiong [1 ,2 ]
Wang, Yunan [1 ,2 ]
Zhu, Yu [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Mech Engn, State Key Lab Tribol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Beijing Key Lab Precis Ultraprecis Mfg Equipments, Beijing 100084, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷
基金
北京市自然科学基金;
关键词
Task analysis; Robots; Predictive models; Visualization; Data models; Artificial neural networks; Shape; Deep learning in robotic manipulation; model learning; representation learning; visual learning;
D O I
10.1109/ACCESS.2021.3077117
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding the physical interactions of objects with environments is critical for multi-object robotic manipulation tasks. A predictive dynamics model can predict the future states of manipulated objects, which is used to plan plausible actions that enable the objects to achieve desired goal states. However, most current approaches on dynamics learning from high-dimensional visual observations have limitations. These methods either rely on a large amount of real-world data or build a model with a fixed number of objects, which makes them difficult to generalize to unseen objects. This paper proposes a Deep Object-centric Interaction Network (DOIN) which encodes object-centric representations for multiple objects from raw RGB images and reasons about the future trajectory for each object in latent space. The proposed model is trained only on large amounts of random interaction data collected in simulation. The learned model combined with a model predictive control framework enables a robot to search action sequences that manipulate objects to the desired configurations. The proposed method is evaluated both in simulation and real-world experiments on multi-object pushing tasks. Extensive simulation experiments show that DOIN can achieve high prediction accuracy in different scenes with different numbers of objects and outperform state-of-the-art baselines in the manipulation tasks. Real-world experiments demonstrate that the model trained on simulated data can be transferred to the real robot and can successfully perform multi-object pushing tasks for previously-unseen objects with significant variations in shape and size.
引用
收藏
页码:68277 / 68288
页数:12
相关论文
共 50 条
  • [21] OBJECT-CENTRIC VIDEO PREDICTION VIA DECOUPLING OF OBJECT DYNAMICS AND INTERACTIONS
    Villar-Corrales, Angel
    Wahdan, Ismail
    Behnke, Sven
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 570 - 574
  • [22] Object-Centric Representation Learning for Video Scene Understanding
    Zhou, Yi
    Zhang, Hui
    Park, Seung-In
    Yoo, ByungIn
    Qi, Xiaojuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8410 - 8423
  • [23] Self-supervised Object-Centric Learning for Videos
    Aydemir, Gorkay
    Xie, Weidi
    Guney, Fatma
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [24] Object-Centric Multiple Object Tracking
    Zhao, Zixu
    Wang, Jiaze
    Horn, Max
    Ding, Yizhuo
    He, Tong
    Bai, Zechen
    Zietlow, Dominik
    Simon-Gabriel, Carl-Johann
    Shuai, Bing
    Tu, Zhuowen
    Brox, Thomas
    Schiele, Bernt
    Fu, Yanwei
    Locatello, Francesco
    Zhang, Zheng
    Xiao, Tianjun
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16555 - 16565
  • [25] Language-Mediated, Object-Centric Representation Learning
    Wang, Ruocheng
    Mao, Jiayuan
    Gershman, Samuel J.
    Wu, Jiajun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2033 - 2046
  • [26] Object-Centric Representation Learning from Unlabeled Videos
    Gao, Ruohan
    Jayaraman, Dinesh
    Grauman, Kristen
    COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 248 - 263
  • [27] Object-Centric Representation Learning for Video Question Answering
    Long Hoang Dang
    Thao Minh Le
    Vuong Le
    Truyen Tran
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [28] Learning Dexterous Grasping with Object-Centric Visual Affordances
    Mandikal, Priyanka
    Grauman, Kristen
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 6169 - 6176
  • [29] An Object-centric Interaction Framework for Tangible Interfaces in Virtual Environments
    Israel, Johann Habakuk
    Belaifa, Oliver
    Gispen, Adrienne
    Stark, Rainer
    TEI 2011: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON TANGIBLE EMBEDDED AND EMBODIED INTERACTION, 2011, : 325 - 332
  • [30] Object-Centric Slot Diffusion
    Jiang, Jindong
    Deng, Fei
    Singh, Gautam
    Ahn, Sungjin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,