Dynamics Learning with Object-Centric Interaction Networks for Robot Manipulation

被引:0
|
作者
Wang, Jiayu [1 ,2 ]
Hu, Chuxiong [1 ,2 ]
Wang, Yunan [1 ,2 ]
Zhu, Yu [1 ,2 ]
机构
[1] Department of Mechanical Engineering, State Key Laboratory of Tribology, Tsinghua University, Beijing, China
[2] Beijing Key Laboratory of Precision, Ultra-Precision Manufacturing Equipments and Control, Tsinghua University, Beijing,100084, China
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Understanding the physical interactions of objects with environments is critical for multi-object robotic manipulation tasks. A predictive dynamics model can predict the future states of manipulated objects, which is used to plan plausible actions that enable the objects to achieve desired goal states. However, most current approaches on dynamics learning from high-dimensional visual observations have limitations. These methods either rely on a large amount of real-world data or build a model with a fixed number of objects, which makes them difficult to generalize to unseen objects. This paper proposes a Deep Object-centric Interaction Network (DOIN) which encodes object-centric representations for multiple objects from raw RGB images and reasons about the future trajectory for each object in latent space. The proposed model is trained only on large amounts of random interaction data collected in simulation. The learned model combined with a model predictive control framework enables a robot to search action sequences that manipulate objects to the desired configurations. The proposed method is evaluated both in simulation and real-world experiments on multi-object pushing tasks. Extensive simulation experiments show that DOIN can achieve high prediction accuracy in different scenes with different numbers of objects and outperform state-of-the-art baselines in the manipulation tasks. Real-world experiments demonstrate that the model trained on simulated data can be transferred to the real robot and can successfully perform multi-object pushing tasks for previously-unseen objects with significant variations in shape and size. © 2013 IEEE.
引用
收藏
页码:68277 / 68288
相关论文
共 50 条
  • [31] Object-Centric Street Scene Synthesis with Generative Adversarial Networks
    Van den Abeele, Maxim
    Neven, Davy
    De Brabandere, Bert
    Proesmans, Marc
    Van Gool, Luc
    20TH IEEE MEDITERRANEAN ELETROTECHNICAL CONFERENCE (IEEE MELECON 2020), 2020, : 665 - 671
  • [32] Floating Waste Discovery by Request via Object-Centric Learning
    Fu, Bingfei
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 1407 - 1424
  • [33] ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation
    Li, Xiaoqi
    Zhang, Mingxu
    Geng, Yiran
    Geng, Haoran
    Long, Yuxing
    Shen, Yan
    Zhang, Renrui
    Liu, Jiaming
    Dong, Hao
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 18061 - 18070
  • [34] Data-efficient learning of object-centric grasp preferences
    Fleytoux, Yoann
    Ma, Anji
    Ivaldi, Serena
    Mouret, Jean-Baptiste
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6337 - 6343
  • [35] Learning object-centric complementary features for zero-shot learning
    Liu, Jie
    Song, Kechen
    He, Yu
    Dong, Hongwen
    Yan, Yunhui
    Meng, Qinggang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 89
  • [36] Object-centric Learning with Cyclic Walks between Parts and Whole
    Wang, Ziyu
    Shou, Mike Zheng
    Zhang, Mengmi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [37] Simple Unsupervised Object-Centric Learning for Complex and Naturalistic Videos
    Singh, Gautam
    Wu, Yi-Fu
    Ahn, Sungjin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [38] Unsupervised Object-Centric Learning From Multiple Unspecified Viewpoints
    Yuan, Jinyang
    Chen, Tonglin
    Shen, Zhimeng
    Li, Bin
    Xue, Xiangyang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 3897 - 3909
  • [39] Learning to Follow Object-Centric Image Editing Instructions Faithfully
    Chakrabarty, Tuhin
    Singh, Kanishk
    Saakyan, Arkadiy
    Muresan, Smaranda
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 9630 - 9646
  • [40] Object-Centric Predictive Process Monitoring
    Gherissi, Wissam
    El Haddad, Joyce
    Grigori, Daniela
    SERVICE-ORIENTED COMPUTING - ICSOC 2022 WORKSHOPS, 2023, 13821 : 27 - 39