An Object Attribute Guided Framework for Robot Learning Manipulations from Human Demonstration Videos

被引:0
|
作者
Zhang, Qixiang [1 ]
Chen, Junhong [1 ]
Liang, Dayong [1 ]
Liu, Huaping [2 ]
Zhou, Xiaojing [1 ]
Ye, Zihan [1 ]
Liu, Wenyin [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Cobot Vis Lab, Guangzhou 510006, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
关键词
D O I
10.1109/iros40897.2019.8967621
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning manipulations from videos is an inspiriting way for robots to acquire new skills. In this paper, we propose a framework that can generate robotic manipulation plans by observing human demonstration videos without special marks or unnatural demonstrated behaviors. More specifically, the framework contains a video parsing module and a robot execution module. The first module recognizes the demonstrator's actions using two-stream convolution neural networks, and classifies the operated objects by adopting a Mask R-CNN. After that, two XGBoost classifiers are applied to further classify the objects into subject object and patient object respectively, according to the demonstrator's actions. In the second module, a grammar-based parser is used to summarize the videos and generate the common instructions for robot execution. Extensive experiments are conducted on a publicly available video datasets consisting of 273 videos and manifest that our approach is able to learn manipulation plans from demonstration videos with high accuracy (73.36%). Furthermore, we integrate our framework with a humanoid robot Baxter to perform the manipulation learning from demonstration videos, which effectively verifies the performance of our framework.
引用
收藏
页码:6113 / 6119
页数:7
相关论文
共 50 条
  • [41] Demonstration of the EMPATHIC Framework for Task Learning from Implicit Human Feedback
    Cui, Yuchen
    Zhang, Qiping
    Jain, Sahil
    Allievi, Alessandro
    Stone, Peter
    Niekum, Scott
    Knox, W. Bradley
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 16017 - 16019
  • [42] Learning to Imitate Object Interactions from Internet Videos
    Patel, Austin
    Wang, Andrew
    Radosavovic, Ilija
    Malik, Jitendra
    arXiv, 2022,
  • [43] Unsupervised Learning of Object Structure and Dynamics from Videos
    Minderer, Matthias
    Sun, Chen
    Villegas, Ruben
    Cole, Forrester
    Murphy, Kevin
    Lee, Honglak
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [44] Human–robot skill transmission for mobile robot via learning by demonstration
    Jiehao Li
    Junzheng Wang
    Shoukun Wang
    Chenguang Yang
    Neural Computing and Applications, 2023, 35 : 23441 - 23451
  • [45] Fuzzy dynamical system for robot learning motion skills from human demonstration
    Teng, Tao
    Gatti, Matteo
    Poni, Stefano
    Caldwell, Darwin
    Chen, Fei
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 164
  • [46] Exploring Implicit Human Responses to Robot Mistakes in a Learning from Demonstration Task
    Hayes, Cory J.
    Moosaei, Maryam
    Riek, Laurel D.
    2016 25TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2016, : 246 - 252
  • [47] An Interactive Framework for Learning User-Object Associations Through Human-Robot Interaction
    Papakostas, Michalis
    Tsiakas, Konstantinos
    Parde, Natalie
    Karkaletsis, Vangelis
    Makedon, Fillia
    8TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2015), 2015,
  • [48] ROBOT LEARNING FROM HUMAN DEMONSTRATION OF ACTIVITIES OF DAILY LIVING (ADL) TASKS
    Trivedi, Urvish
    Alqasemi, Redwan
    Dubey, Rajiv
    PROCEEDINGS OF ASME 2021 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION (IMECE2021), VOL 6, 2021,
  • [49] Learning from Demonstration Facilitates Human-Robot Collaborative Task Execution
    Koskinopoulou, Maria
    Piperakis, Stylimos
    Frahanias, Panos
    ELEVENTH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN ROBOT INTERACTION (HRI'16), 2016, : 59 - 66
  • [50] Communication and knowledge sharing in human-robot interaction and learning from demonstration
    Koenig, Nathan
    Takayama, Leila
    Mataric, Maja
    NEURAL NETWORKS, 2010, 23 (8-9) : 1104 - 1112