An Object Attribute Guided Framework for Robot Learning Manipulations from Human Demonstration Videos

被引:0
|
作者
Zhang, Qixiang [1 ]
Chen, Junhong [1 ]
Liang, Dayong [1 ]
Liu, Huaping [2 ]
Zhou, Xiaojing [1 ]
Ye, Zihan [1 ]
Liu, Wenyin [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Cobot Vis Lab, Guangzhou 510006, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
关键词
D O I
10.1109/iros40897.2019.8967621
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning manipulations from videos is an inspiriting way for robots to acquire new skills. In this paper, we propose a framework that can generate robotic manipulation plans by observing human demonstration videos without special marks or unnatural demonstrated behaviors. More specifically, the framework contains a video parsing module and a robot execution module. The first module recognizes the demonstrator's actions using two-stream convolution neural networks, and classifies the operated objects by adopting a Mask R-CNN. After that, two XGBoost classifiers are applied to further classify the objects into subject object and patient object respectively, according to the demonstrator's actions. In the second module, a grammar-based parser is used to summarize the videos and generate the common instructions for robot execution. Extensive experiments are conducted on a publicly available video datasets consisting of 273 videos and manifest that our approach is able to learn manipulation plans from demonstration videos with high accuracy (73.36%). Furthermore, we integrate our framework with a humanoid robot Baxter to perform the manipulation learning from demonstration videos, which effectively verifies the performance of our framework.
引用
收藏
页码:6113 / 6119
页数:7
相关论文
共 50 条
  • [1] Robot Learning of Everyday Object Manipulations via Human Demonstration
    Dang, Hao
    Allen, Peter K.
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 1284 - 1289
  • [2] Two-stream 2D/3D Residual Networks for Learning Robot Manipulations from Human Demonstration Videos
    Xu, Xin
    Qian, Kun
    Zhou, Bo
    Chen, Shenghao
    Li, Yitong
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 3353 - 3358
  • [3] A Human-Robot Collaboration Framework Based on Human Collaboration Demonstration and Robot Learning
    Peng, Xiang
    Jiang, Jingang
    Xia, Zeyang
    Xiong, Jing
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT VII, 2025, 15207 : 286 - 299
  • [4] Semantic learning from keyframe demonstration using object attribute constraints
    Sen, Busra
    Elfring, Jos
    Torta, Elena
    van de Molengraft, Rene
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [5] An Intuitive Robot Learning from Human Demonstration
    Ogenyi, Uchenna Emeoha
    Zhang, Gongyue
    Yang, Chenguang
    Ju, Zhaojie
    Liu, Honghai
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2018), PT I, 2018, 10984 : 176 - 185
  • [6] Learning Under-specified Object Manipulations From Human Demonstrations
    Qian, Kun
    Xu, Jun
    Gao, Ge
    Fang, Fang
    Ma, Xudong
    2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 1936 - 1941
  • [7] A Human-Robot Collaboration Method Using a Pose Estimation Network for Robot Learning of Assembly Manipulation Trajectories From Demonstration Videos
    Deng, Xinjian
    Liu, Jianhua
    Gong, Honghui
    Gong, Hao
    Huang, Jiayu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (05) : 7160 - 7168
  • [8] An Ergo-Interactive Framework for Human-Robot Collaboration Via Learning From Demonstration
    Liao, Zhiwei
    Lorenzini, Marta
    Leonori, Mattia
    Zhao, Fei
    Jiang, Gedong
    Ajoudani, Arash
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 359 - 366
  • [9] Learning cooperative dynamic manipulation skills from human demonstration videos
    Iodice, Francesco
    Wu, Yuqiang
    Kim, Wansoo
    Zhao, Fei
    De Momi, Elena
    Ajoudani, Arash
    MECHATRONICS, 2022, 85
  • [10] A Joint Learning Framework for Attribute Models and Object Descriptions
    Mahajan, Dhruv
    Sellamanickam, Sundararajan
    Nair, Vinod
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 1227 - 1234