The RGB-D Triathlon: Towards Agile Visual Toolboxes for Robots

被引:0
|
作者
Cermelli, Fabio [1 ]
Mancini, Massimiliano [2 ,3 ,5 ]
Ricci, Elisa [3 ,4 ]
Caputo, Barbara [1 ,5 ]
机构
[1] Politecn Torino, Turin, Italy
[2] Sapienza Univ Rome, Rome, Italy
[3] Fdn Bruno Kessler, Trento, Italy
[4] Univ Trento, Trento, Italy
[5] Italian Inst Technol, Milan, Italy
关键词
D O I
10.1109/iros40897.2019.8968562
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep networks have brought significant advances in robot perception, enabling to improve the capabilities of robots in several visual tasks, ranging from object detection and recognition to pose estimation, semantic scene segmentation and many others. Still, most approaches typically address visual tasks in isolation, resulting in overspecialized models which achieve strong performances in specific applications but work poorly in other (often related) tasks. This is clearly sub-optimal for a robot which is often required to perform simultaneously multiple visual recognition tasks in order to properly act and interact with the environment. This problem is exacerbated by the limited computational and memory resources typically available onboard to a robotic platform. The problem of learning flexible models which can handle multiple tasks in a lightweight manner has recently gained attention in the computer vision community and benchmarks supporting this research have been proposed. In this work we study this problem in the robot vision context, proposing a new benchmark, the RGB-D Triathlon, and evaluating state of the art algorithms in this novel challenging scenario. We also define a new evaluation protocol, better suited to the robot vision setting. Results shed light on the strengths and weaknesses of existing approaches and on open issues, suggesting directions for future research.
引用
收藏
页码:6097 / 6104
页数:8
相关论文
共 50 条
  • [1] ADAPTIVE RGB-D VISUAL ODOMETRY FOR MOBILE ROBOTS: AN EXPERIMENTAL STUDY
    Anderson, J. Wesley
    Fabian, Joshua R.
    Clayton, Garrett M.
    PROCEEDINGS OF THE ASME 8TH ANNUAL DYNAMIC SYSTEMS AND CONTROL CONFERENCE, 2015, VOL 3, 2016,
  • [2] Hybrid Uncalibrated Visual Servoing Control of Harvesting Robots With RGB-D Cameras
    Li, Tao
    Yu, Jinpeng
    Qiu, Quan
    Zhao, Chunjiang
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (03) : 2729 - 2738
  • [3] Visual SLAM with RGB-D Cameras
    Jin, Qiongyao
    Liu, Yungang
    Man, Yongchao
    Li, Fengzhong
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 4072 - 4077
  • [4] Dynamic RGB-D Visual Odometry
    Yang, Dongsheng
    Bi, Shusheng
    Cai, Yueri
    Zheng, Jingxiang
    Yuan, Chang
    2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 941 - 946
  • [5] Toward a Unified Framework for RGB and RGB-D Visual Navigation
    Du, Heming
    Huang, Zi
    Chapman, Scott
    Yu, Xin
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 363 - 375
  • [6] Fast RGB-D people tracking for service robots
    Matteo Munaro
    Emanuele Menegatti
    Autonomous Robots, 2014, 37 : 227 - 242
  • [7] Person Following with a RGB-D Camera for Mobile Robots
    Yoon, Youngwoo
    Yoon, Hosub
    Kim, Jaehong
    2012 9TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAL), 2012, : 191 - 191
  • [8] EXTRINSIC RGB-D CAMERA CALIBRATION FOR LEGGED ROBOTS
    Hoepflinger, Mark A.
    Remy, C. David
    Hutter, Marco
    Siegwart, Roland
    FIELD ROBOTICS, 2012, : 94 - 101
  • [9] Fast RGB-D people tracking for service robots
    Munaro, Matteo
    Menegatti, Emanuele
    AUTONOMOUS ROBOTS, 2014, 37 (03) : 227 - 242
  • [10] roboSLAM: Dense RGB-D SLAM for Humanoid Robots
    Hourdakis, Emmanouil
    Piperakis, Stylianos
    Trahanias, Panos
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2224 - 2231