Dynamics Learning With Object-Centric Interaction Networks for Robot Manipulation

被引:7
|
作者
Wang, Jiayu [1 ,2 ]
Hu, Chuxiong [1 ,2 ]
Wang, Yunan [1 ,2 ]
Zhu, Yu [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Mech Engn, State Key Lab Tribol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Beijing Key Lab Precis Ultraprecis Mfg Equipments, Beijing 100084, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷
基金
北京市自然科学基金;
关键词
Task analysis; Robots; Predictive models; Visualization; Data models; Artificial neural networks; Shape; Deep learning in robotic manipulation; model learning; representation learning; visual learning;
D O I
10.1109/ACCESS.2021.3077117
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding the physical interactions of objects with environments is critical for multi-object robotic manipulation tasks. A predictive dynamics model can predict the future states of manipulated objects, which is used to plan plausible actions that enable the objects to achieve desired goal states. However, most current approaches on dynamics learning from high-dimensional visual observations have limitations. These methods either rely on a large amount of real-world data or build a model with a fixed number of objects, which makes them difficult to generalize to unseen objects. This paper proposes a Deep Object-centric Interaction Network (DOIN) which encodes object-centric representations for multiple objects from raw RGB images and reasons about the future trajectory for each object in latent space. The proposed model is trained only on large amounts of random interaction data collected in simulation. The learned model combined with a model predictive control framework enables a robot to search action sequences that manipulate objects to the desired configurations. The proposed method is evaluated both in simulation and real-world experiments on multi-object pushing tasks. Extensive simulation experiments show that DOIN can achieve high prediction accuracy in different scenes with different numbers of objects and outperform state-of-the-art baselines in the manipulation tasks. Real-world experiments demonstrate that the model trained on simulated data can be transferred to the real robot and can successfully perform multi-object pushing tasks for previously-unseen objects with significant variations in shape and size.
引用
收藏
页码:68277 / 68288
页数:12
相关论文
共 50 条
  • [41] Recasting Generic Pretrained Vision Transformers As Object-Centric Scene Encoders For Manipulation Policies
    Qian, Jianing
    Panagopoulos, Anastasios
    Jayaraman, Dinesh
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 17544 - 17552
  • [42] Learning Object-Centric Dynamic Modes from Video and Emerging Properties
    Comas, Armand
    Fernandez-Lopez, Christian
    Ghimire, Sandesh
    Li, Haolin
    Sznaier, Mario
    Camps, Octavia
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [43] Learning global object-centric representations via disentangled slot attention
    Chen, Tonglin
    Huang, Yinxuan
    Shen, Zhimeng
    Huang, Jinghao
    Li, Bin
    Xue, Xiangyang
    MACHINE LEARNING, 2025, 114 (02)
  • [44] OPerA: Object-Centric Performance Analysis
    Park, Gyunam
    Adams, Jan Niklas
    van der Aalst, Wil M. P.
    CONCEPTUAL MODELING (ER 2022), 2022, 13607 : 281 - 292
  • [45] OCπ: Object-Centric Process Insights
    Adams, Jan Niklas
    van der Aalst, Wil M. P.
    APPLICATION AND THEORY OF PETRI NETS AND CONCURRENCY (PETRI NETS 2022), 2022, 13288 : 139 - 150
  • [46] Object-Centric Unsupervised Image Captioning
    Meng, Zihang
    Yang, David
    Cao, Xuefei
    Shah, Ashish
    Lim, Ser-Nam
    COMPUTER VISION, ECCV 2022, PT XXXVI, 2022, 13696 : 219 - 235
  • [47] Discovering Object-centric Petri Nets
    van der Aalst, Wil M. P.
    Berti, Alessandro
    FUNDAMENTA INFORMATICAE, 2020, 175 (1-4) : 1 - 40
  • [48] Discovery of Object-Centric Declarative Models
    Christfort, Axel K. F.
    Rivkin, Audrey
    Fahland, Dirk
    Hildebrandt, Thomas T.
    Slaats, Tijs
    2024 6TH INTERNATIONAL CONFERENCE ON PROCESS MINING, ICPM, 2024, : 137 - 144
  • [49] Permission Analysis for Object-Centric Processes
    Breitmayer, Marius
    Arnold, Lisa
    Reichert, Manfred
    INTELLIGENT INFORMATION SYSTEMS, CAISE FORUM 2024, 2024, 520 : 11 - 19
  • [50] Representation learning from videos in-the-wild: An object-centric approach
    Romijnders, Rob
    Mahendran, Aravindh
    Tschannen, Michael
    Djolonga, Josip
    Ritter, Marvin
    Houlsby, Neil
    Lucic, Mario
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 177 - 187