Dense real-time mapping of object-class semantics from RGB-D video

被引:42
|
作者
Stueckler, Joerg [1 ]
Waldvogel, Benedikt [1 ]
Schulz, Hannes [1 ]
Behnke, Sven [1 ]
机构
[1] Univ Bonn, Comp Sci Insitute 6, Autonomous Intelligent Syst, Bonn, Germany
关键词
Semantic maps; Simultaneous localization and semantic mapping; Object-class segmentation; Random decision forests; MAPS;
D O I
10.1007/s11554-013-0379-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a real-time approach to learn semantic maps from moving RGB-D cameras. Our method models geometry, appearance, and semantic labeling of surfaces. We recover camera pose using simultaneous localization and mapping while concurrently recognizing and segmenting object classes in the images. Our object-class segmentation approach is based on random decision forests and yields a dense probabilistic labeling of each image. We implemented it on GPU to achieve a high frame rate. The probabilistic segmentation is fused in octree-based 3D maps within a Bayesian framework. In this way, image segmentations from various view points are integrated within a 3D map which improves segmentation quality. We evaluate our system on a large benchmark dataset and demonstrate state-of-the-art recognition performance of our object-class segmentation and semantic mapping approaches.
引用
收藏
页码:599 / 609
页数:11
相关论文
共 50 条
  • [31] Dense RGB-D SLAM with Planes Detection and Mapping
    Pan, Liang
    Wang, Pengfei
    Cao, Jiawei
    Chew, Chee-Meng
    45TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2019), 2019, : 5192 - 5197
  • [32] Real-time reconstruction of pipes using RGB-D cameras
    Kim, Dong-Min
    Ahn, JeongHyeon
    Kim, Seung-wook
    Lee, Jongmin
    Kim, Myungho
    Han, JungHyun
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2024, 35 (01)
  • [33] Real-time SLAM algorithm based on RGB-D data
    Fu, Mengyin
    Lü, Xianwei
    Liu, Tong
    Yang, Yi
    Li, Xinghe
    Li, Yu
    Jiqiren/Robot, 2015, 37 (06): : 683 - 692
  • [34] Real-Time RGB-D Activity Prediction by Soft Regression
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Ma, Lianyang
    Wang, Gang
    Lai, Jianhuang
    COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 280 - 296
  • [35] Real-time depth enhancement by fusion for RGB-D cameras
    Garcia, Frederic
    Aouada, Djamila
    Solignac, Thomas
    Mirbach, Bruno
    Ottersten, Bjoern
    IET COMPUTER VISION, 2013, 7 (05) : 335 - 345
  • [36] A Real-time Virtual Dressing System with RGB-D Camera
    Chen, Mingliang
    Lin, Weiyao
    Zhou, Bing
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1041 - 1044
  • [37] A Real-Time Pedestrian Counting System Based on RGB-D
    Yao, Yang
    Zhang, Xu
    Liang, Yu
    Zhang, Xin
    Shen, Furao
    Zhao, Jian
    2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2020, : 110 - 117
  • [38] DVSOD: RGB-D Video Salient Object Detection
    Li, Jingjing
    Ji, Wei
    Wang, Size
    Li, Wenbo
    Cheng, Li
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [39] Real-Time Pixel-Wise Grasp Detection Based on RGB-D Feature Dense Fusion
    Wu, Yongxiang
    Fu, Yili
    Wang, Shuguo
    2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2021), 2021, : 970 - 975
  • [40] Object class segmentation of RGB-D video using recurrent convolutional neural networks
    Pavel, Mircea Serban
    Schulz, Hannes
    Behnke, Sven
    NEURAL NETWORKS, 2017, 88 : 105 - 113