RGB-D object detection and semantic segmentation for autonomous manipulation in clutter

被引:123
|
作者
Schwarz, Max [1 ]
Milan, Anton [2 ]
Periyasamy, Arul Selvam [1 ]
Behnke, Sven [1 ]
机构
[1] Univ Bonn, Bonn, Germany
[2] Univ Adelaide, Adelaide, SA, Australia
来源
基金
欧盟地平线“2020”;
关键词
Deep learning; object perception; RGB-D camera; transfer learning; object detection; semantic segmentation;
D O I
10.1177/0278364917713117
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Autonomous robotic manipulation in clutter is challenging. A large variety of objects must be perceived in complex scenes, where they are partially occluded and embedded among many distractors, often in restricted spaces. To tackle these challenges, we developed a deep-learning approach that combines object detection and semantic segmentation. The manipulation scenes are captured with RGB-D cameras, for which we developed a depth fusion method. Employing pretrained features makes learning from small annotated robotic datasets possible. We evaluate our approach on two challenging datasets: one captured for the Amazon Picking Challenge 2016, where our team NimbRo came in second in the Stowing and third in the Picking task; and one captured in disaster-response scenarios. The experiments show that object detection and semantic segmentation complement each other and can be combined to yield reliable object perception.
引用
收藏
页码:437 / 451
页数:15
相关论文
共 50 条
  • [31] Non-Local Aggregation for RGB-D Semantic Segmentation
    Zhang, Guodong
    Xue, Jing-Hao
    Xie, Pengwei
    Yang, Sifan
    Wang, Guijin
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 658 - 662
  • [32] Learning Strengths and Weaknesses of Classifiers for RGB-D Semantic Segmentation
    Fooladgar, Fahimeh
    Kasaei, Shohreh
    2015 9TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2015, : 176 - 179
  • [33] Cascaded Feature Network for Semantic Segmentation of RGB-D Images
    Lin, Di
    Chen, Guangyong
    Cohen-Or, Daniel
    Heng, Pheng-Ann
    Huang, Hui
    Proceedings of the IEEE International Conference on Computer Vision, 2017, 2017-October : 1320 - 1328
  • [34] Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
    Yang, Jun
    Bai, Lizhi
    Sun, Yaoru
    Tian, Chunqi
    Mao, Maoyu
    Wang, Guorun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1481 - 1492
  • [35] Semantic segmentation with Recurrent Neural Networks on RGB-D videos
    Gao, Chuan
    Wang, Weihong
    Chen, Mingxi
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1203 - 1207
  • [36] Evaluation of Multimodal Semantic Segmentation using RGB-D Data
    Hu, Jiesi
    Zhao, Ganning
    You, Suya
    Kuo, C. C. Jay
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS III, 2021, 11746
  • [37] Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis
    Seichter, Daniel
    Koehler, Mona
    Lewandowski, Benjamin
    Wengefeld, Tim
    Gross, Horst-Michael
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13525 - 13531
  • [38] A Fusion Network for Semantic Segmentation Using RGB-D Data
    Yuan, Jiahui
    Zhang, Kun
    Xia, Yifan
    Qi, Lin
    Dong, Junyu
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [39] MLBSNet: Mutual Learning and Boosting Segmentation Network for RGB-D Salient Object Detection
    Xia, Chenxing
    Wang, Jingjing
    Ge, Bing
    ELECTRONICS, 2024, 13 (14)
  • [40] ObjectFusion: An object detection and segmentation framework with RGB-D SLAM and convolutional neural networks
    Tian, Guanzhong
    Liu, Liang
    Ri, JongHyok
    Liu, Yong
    Sun, Yiran
    NEUROCOMPUTING, 2019, 345 : 3 - 14