Composable Instructions and Prospection Guided Visuomotor Control for Robotic Manipulation

被引:2
|
作者
Shao, Quanquan [1 ]
Hu, Jie [1 ]
Wang, Weiming [1 ]
Fang, Yi [1 ]
Han, Mingshuo [1 ]
Qi, Jin [1 ]
Ma, Jin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Mech Engn, Inst Knowledge Based Engn, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Composable instructions; Motion generation; Prospection; Imitation learning; Visuomotor control; Robotic manipulation;
D O I
10.2991/ijcis.d.191017.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural network-based end-to-end visuomotor control for robotic manipulation is becoming a hot issue of robotics field recently. One-hot vector is often used for multi-task situation in this framework. However, it is inflexible using one-hot vector to describe multiple tasks and transmit intentions of humans. This paper proposes a framework by combining composable instructions with visuomotor control for multi-task problems. The framework mainly consists of two modules: variational autoencoder (VAE) networks and long short-term memory (LSTM) networks. Perception information of the environment is encoded by VAE into a small latent space. The embedded perception information and composable instructions are combined by the LSTM module to guide robotic motion based on different intentions. Prospection is also used to learn the purposes of instructions, which means not only predicting the next action but also predicting a sequence of future actions at the same time. To evaluate this framework, a series of experiments are conducted in pick-and-place application scenarios. For new tasks, the framework could obtain a success rate of 91.2%, which means it has a good generalization ability. (C) 2019 The Authors. Published by Atlantis Press SARL.
引用
收藏
页码:1221 / 1231
页数:11
相关论文
共 50 条
  • [1] Composable Instructions and Prospection Guided Visuomotor Control for Robotic Manipulation
    Quanquan Shao
    Jie Hu
    Weiming Wang
    Yi Fang
    Mingshuo Han
    Jin Qi
    Jin Ma
    International Journal of Computational Intelligence Systems, 2019, 12 : 1221 - 1231
  • [2] Composable Deep Reinforcement Learning for Robotic Manipulation
    Haarnoja, Tuomas
    Pong, Vitchyr
    Zhou, Aurick
    Dalal, Murtaza
    Abbeel, Pieter
    Levine, Sergey
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 6244 - 6251
  • [3] Robotic grasping and manipulation through human visuomotor learning
    Moore, Brian
    Oztop, Erhan
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2012, 60 (03) : 441 - 451
  • [4] Exploiting linearity in dynamics solvers for the design of composable robotic manipulation architectures
    Schneider, Sven
    Bruyninckx, Herman
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 7439 - 7446
  • [5] VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
    Huang, Wenlong
    Wang, Chen
    Zhang, Ruohan
    Li, Yunzhu
    Wu, Jiajun
    Fei-Fei, Li
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [6] Adversarial Feature Training for Generalizable Robotic Visuomotor Control
    Chen, Xi
    Ghadirzadeh, Ali
    Bjoerkman, Marten
    Jensfelt, Patric
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 1142 - 1148
  • [7] Language Guided Robotic Grasping with Fine-grained Instructions
    Sun, Qiang
    Lin, Haitao
    Fu, Ying
    Fu, Yanwei
    Xue, Xiangyang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1319 - 1326
  • [8] Human-Guided Robotic Manipulation: Theory and Experiments
    Li, X.
    Cheah, C. C.
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 4594 - 4599
  • [9] Spatio-Temporal Deep Learning for Robotic Visuomotor Control
    Pierre, John M.
    CONFERENCE PROCEEDINGS OF 2018 4TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2018, : 94 - 103
  • [10] A control structure for robotic dynamic manipulation
    Zheng, XZ
    Ono, K
    Yamakita, M
    Katayama, M
    Ito, K
    INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 1489 - 1494