FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion

被引:128
|
作者
Sun, Yuxiang [1 ]
Zuo, Weixun [1 ]
Yun, Peng [2 ]
Wang, Hengli [1 ]
Liu, Ming [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Image segmentation; Cameras; Lighting; Laser radar; Data integration; Autonomous driving; information fusion; semantic segmentation; thermal images; urban scenes; DYNAMIC ENVIRONMENTS; MOTION REMOVAL; D SLAM; POINT; NETWORK;
D O I
10.1109/TASE.2020.2993143
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation of urban scenes is an essential component in various applications of autonomous driving. It makes great progress with the rise of deep learning technologies. Most of the current semantic segmentation networks use single-modal sensory data, which are usually the RGB images produced by visible cameras. However, the segmentation performance of these networks is prone to be degraded when lighting conditions are not satisfied, such as dim light or darkness. We find that thermal images produced by thermal imaging cameras are robust to challenging lighting conditions. Therefore, in this article, we propose a novel RGB and thermal data fusion network named FuseSeg to achieve superior performance of semantic segmentation in urban scenes. The experimental results demonstrate that our network outperforms the state-of-the-art networks. Note to Practitioners-This article investigates the problem of semantic segmentation of urban scenes when lighting conditions are not satisfied. We provide a solution to this problem via information fusion with RGB and thermal data. We build an end-to-end deep neural network, which takes as input a pair of RGB and thermal images and outputs pixel-wise semantic labels. Our network could be used for urban scene understanding, which serves as a fundamental component of many autonomous driving tasks, such as environment modeling, obstacle avoidance, motion prediction, and planning. Moreover, the simple design of our network allows it to be easily implemented using various deep learning frameworks, which facilitates the applications on different hardware or software platforms.
引用
收藏
页码:1000 / 1011
页数:12
相关论文
共 50 条
  • [21] Unsupervised Domain Adaptation for Semantic Segmentation of Urban Scenes
    Biasetton, Matteo
    Michieli, Umberto
    Agresti, Gianluca
    Zanuttigh, Pietro
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1211 - 1220
  • [22] Improving semantic segmentation in urban scenes with a cartographic information
    Loukkal, Abdelhak
    Fremont, Vincent
    Grandvalet, Yves
    Li, You
    2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 400 - 406
  • [23] Semantic Segmentation of Urban Scenes Using Spatial Contexts
    Wang, Jeonghyeon
    Kim, Jinwhan
    IEEE ACCESS, 2020, 8 : 55254 - 55268
  • [24] DHFNet: dual-decoding hierarchical fusion network for RGB-thermal semantic segmentation
    Cai, Yuqi
    Zhou, Wujie
    Zhang, Liting
    Yu, Lu
    Luo, Ting
    VISUAL COMPUTER, 2024, 40 (01): : 169 - 179
  • [25] UTFNet: Uncertainty-Guided Trustworthy Fusion Network for RGB-Thermal Semantic Segmentation
    Wang, Qingwang
    Yin, Cheng
    Song, Haochen
    Shen, Tao
    Gu, Yanfeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [26] HAFFseg: RGB-Thermal semantic segmentation network with hybrid adaptive feature fusion strategy
    Yi, Shi
    Chen, Mengting
    Liu, Xi
    Li, Junjie
    Chen, Ling
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 117
  • [27] DHFNet: dual-decoding hierarchical fusion network for RGB-thermal semantic segmentation
    Yuqi Cai
    Wujie Zhou
    Liting Zhang
    Lu Yu
    Ting Luo
    The Visual Computer, 2024, 40 : 169 - 179
  • [28] CNN based Semantic Segmentation for Urban Traffic Scenes using Fisheye Camera
    Deng, Liuyuan
    Yang, Ming
    Qian, Yeqiang
    Wang, Chunxiang
    Wang, Bing
    2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 231 - 236
  • [29] Transformer fusion for indoor RGB-D semantic segmentation
    Wu, Zongwei
    Zhou, Zhuyun
    Allibert, Guillaume
    Stolz, Christophe
    Demonceaux, Cedric
    Ma, Chao
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [30] An RGB-D Fusion Based Semantic Segmentation Algorithm Based on Neighborhood Metric Relations
    Zhang J.
    Chen Y.
    Zhu S.
    Li Y.
    Jiqiren/Robot, 2023, 45 (02): : 156 - 165