Detecting Target Objects by Natural Language Instructions Using an RGB-D Camera

被引:2
|
作者
Bao, Jiatong [1 ]
Jia, Yunyi [2 ]
Cheng, Yu [3 ]
Tang, Hongru [1 ]
Xi, Ning [3 ]
机构
[1] Yangzhou Univ, Dept Hydraul Energy & Power Engn, Yangzhou 225127, Jiangsu, Peoples R China
[2] Clemson Univ, Dept Automot Engn, Greenville, SC 29607 USA
[3] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
来源
SENSORS | 2016年 / 16卷 / 12期
关键词
object grounding; target object detection; object recognition; natural language processing; natural language control; robotic manipulation system; FRAMEWORK;
D O I
10.3390/s16122117
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Controlling robots by natural language (NL) is increasingly attracting attention for its versatility, convenience and no need of extensive training for users. Grounding is a crucial challenge of this problem to enable robots to understand NL instructions from humans. This paper mainly explores the object grounding problem and concretely studies how to detect target objects by the NL instructions using an RGB-D camera in robotic manipulation applications. In particular, a simple yet robust vision algorithm is applied to segment objects of interest. With the metric information of all segmented objects, the object attributes and relations between objects are further extracted. The NL instructions that incorporate multiple cues for object specifications are parsed into domain-specific annotations. The annotations from NL and extracted information from the RGB-D camera are matched in a computational state estimation framework to search all possible object grounding states. The final grounding is accomplished by selecting the states which have the maximum probabilities. An RGB-D scene dataset associated with different groups of NL instructions based on different cognition levels of the robot are collected. Quantitative evaluations on the dataset illustrate the advantages of the proposed method. The experiments of NL controlled object manipulation and NL-based task programming using a mobile manipulator show its effectiveness and practicability in robotic applications.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Pallet recognition and localization using an RGB-D camera
    Xiao, Junhao
    Lu, Huimin
    Zhang, Lilian
    Zhang, Jianhua
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2017, 14 (06):
  • [22] Recognition of Overlapped Objects using RGB-D Sensor
    Yukitoh, Mitsuhiro
    Oka, Takaaki
    Morimoto, Masakazu
    2017 6TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION & 2017 7TH INTERNATIONAL SYMPOSIUM IN COMPUTATIONAL MEDICAL AND HEALTH TECHNOLOGY (ICIEV-ISCMHT), 2017,
  • [23] Visualization of Temperature Change Using RGB-D Camera and Thermal Camera
    Nakagawa, Wataru
    Matsumoto, Kazuki
    de Sorbier, Francois
    Sugimoto, Maki
    Saito, Hideo
    Senda, Shuji
    Shibata, Takashi
    Iketani, Akihiko
    COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 386 - 400
  • [24] DETECTING WALKABLE PLANE AREAS BY USING RGB-D CAMERA AND ACCELEROMETER FOR VISUALLY IMPAIRED PEOPLE
    Imai, Kenta
    Kitahara, Itaru
    Kameda, Yoshinari
    2017 3DTV CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON), 2017,
  • [25] Real-Time Moving Objects Segmentation based on RGB-D camera
    Zhu, Rui
    Zhao, Yongjia
    2018 IEEE CSAA GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2018,
  • [26] Wound measurement by RGB-D camera
    Filko, Damir
    Cupec, Robert
    Nyarko, Emmanuel Karlo
    MACHINE VISION AND APPLICATIONS, 2018, 29 (04) : 633 - 654
  • [27] Wound measurement by RGB-D camera
    Damir Filko
    Robert Cupec
    Emmanuel Karlo Nyarko
    Machine Vision and Applications, 2018, 29 : 633 - 654
  • [28] Asymmetric Deeply Fused Network for Detecting Salient Objects in RGB-D Images
    Liu, Chang
    Zhou, Wujie
    Chen, Yuzhen
    Lei, Jingsheng
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 1620 - 1624
  • [29] 3D Visual SLAM using RGB-D Camera
    Krerngkamjornkit, Rapee
    Simic, Milan
    SMART DIGITAL FUTURES 2014, 2014, 262 : 533 - 544
  • [30] 3-D Mapping and Localization using RGB-D Camera
    Pudchuen, Noppadol
    Jitviriya, Wisanu
    2019 FIRST INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION, CONTROL, ARTIFICIAL INTELLIGENCE, AND ROBOTICS (ICA-SYMP 2019), 2019, : 139 - 142