FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion

被引：128

作者：

Sun, Yuxiang ^{[1
]}

Zuo, Weixun ^{[1
]}

Yun, Peng ^{[2
]}

Wang, Hengli ^{[1
]}

Liu, Ming ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China

[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2021年 / 18卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Semantics; Image segmentation; Cameras; Lighting; Laser radar; Data integration; Autonomous driving; information fusion; semantic segmentation; thermal images; urban scenes; DYNAMIC ENVIRONMENTS; MOTION REMOVAL; D SLAM; POINT; NETWORK;

D O I：

10.1109/TASE.2020.2993143

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Semantic segmentation of urban scenes is an essential component in various applications of autonomous driving. It makes great progress with the rise of deep learning technologies. Most of the current semantic segmentation networks use single-modal sensory data, which are usually the RGB images produced by visible cameras. However, the segmentation performance of these networks is prone to be degraded when lighting conditions are not satisfied, such as dim light or darkness. We find that thermal images produced by thermal imaging cameras are robust to challenging lighting conditions. Therefore, in this article, we propose a novel RGB and thermal data fusion network named FuseSeg to achieve superior performance of semantic segmentation in urban scenes. The experimental results demonstrate that our network outperforms the state-of-the-art networks. Note to Practitioners-This article investigates the problem of semantic segmentation of urban scenes when lighting conditions are not satisfied. We provide a solution to this problem via information fusion with RGB and thermal data. We build an end-to-end deep neural network, which takes as input a pair of RGB and thermal images and outputs pixel-wise semantic labels. Our network could be used for urban scene understanding, which serves as a fundamental component of many autonomous driving tasks, such as environment modeling, obstacle avoidance, motion prediction, and planning. Moreover, the simple design of our network allows it to be easily implemented using various deep learning frameworks, which facilitates the applications on different hardware or software platforms.

引用

页码：1000 / 1011

页数：12

共 50 条

[41] Semantic Segmentation of Urban Scenes Using Dense Depth Maps
Zhang, Chenxi
Wang, Liang
Yang, Ruigang
COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 : 708 - 721
[42] Semantic segmentation of urban scenes by learning local class interactions
Volpi, Michele
Ferrari, Vittorio
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
[43] A Curriculum Domain Adaptation Approach to the Semantic Segmentation of Urban Scenes
Zhang, Yang
David, Philip
Foroosh, Hassan
Gong, Boqing
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (08) : 1823 - 1841
[44] Real-Time Flame Segmentation based on RGB-Thermal Fusion
Guo, Shuaihao
Hu, Biao
Huang, Ran
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 1435 - 1440
[45] Open-Vocabulary RGB-Thermal Semantic Segmentation
Zhao, Guoqiang
Huang, Junjie
Yan, Xiaoyun
Wang, Zhaojing
Tang, Junwei
Ou, Yangjun
Hu, Xinrong
Peng, Tao
COMPUTER VISION - ECCV 2024, PT LXXIV, 2025, 15132 : 304 - 320
[46] A Lightweight RGB-T Fusion Network for Practical Semantic Segmentation
Zhang, Haoyuan
Li, Zifeng
Wu, Zhenyu
Wang, Danwei
2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 4233 - 4238
[47] Semantic segmentation of manipulator grasping scene with fusion of RGB and depth information
Ding, Jiyuan
Mo, Yanlang
Xiong, Xiaogang
INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2020, 2020, 11574
[48] Based on cross-scale fusion attention mechanism network for semantic segmentation for street scenes
Ye, Xin
Gao, Lang
Chen, Jichen
Lei, Mingyue
FRONTIERS IN NEUROROBOTICS, 2023, 17
[49] CGFNet: cross-guided fusion network for RGB-thermal semantic segmentation CGI PaperID: 105
Fu, Yanping
Chen, Qiaoqiao
Zhao, Haifeng
VISUAL COMPUTER, 2022, 38 (9-10): : 3243 - 3252
[50] SFAF-MA: Spatial Feature Aggregation and Fusion With Modality Adaptation for RGB-Thermal Semantic Segmentation
He, Xunjie
Wang, Meiling
Liu, Tong
Zhao, Lin
Yue, Yufeng
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72

← 1 2 3 4 5 →