Multi-modal unsupervised domain adaptation for semantic image segmentation

被引：15

作者：

Hu, Sijie ^{[1
]}

Bonardi, Fabien ^{[1
]}

Bouchafa, Samia ^{[1
]}

Sidibe, Desire ^{[1
]}

机构：

[1] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry Courcouronnes, France

来源：

PATTERN RECOGNITION | 2023年 / 137卷

关键词：

Unsupervised domain adaptation; Multi -modal learning; Self -supervised learning; Knowledge transfer; Semantic segmentation;

D O I：

10.1016/j.patcog.2022.109299

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel multi-modal-based Unsupervised Domain Adaptation (UDA) method for semantic segmentation. Recently, depth has proven to be a relevent property for providing geometric cues to en-hance the RGB representation. However, existing UDA methods solely process RGB images or additionally cultivate depth-awareness with an auxiliary depth estimation task. We argue that geometric cues that are crucial to semantic segmentation, such as local shape and relative position, are challenging to recover from an auxiliary depth estimation task with mere color (RGB) information. In this paper, we propose a novel multi-modal UDA method named MMADT, which relies on both RGB and depth images as input. In particular, we design a Depth Fusion Block (DFB) to recalibrate depth information and leverage Depth Ad-versarial Training (DAT) to bridge the depth discrepancy between the source and target domain. Besides, we propose a self-supervised multi-modal depth estimation assistant network named Geo-Assistant (GA) to align the feature space of RGB and depth and shape the sensitivity of our MMADT to depth infor-mation. We experimentally observed significant performance improvement in multiple synthetic to real adaptation benchmarks, i.e., SYNTHIA-to-Cityscapes, GTA5-to-Cityscapes and SELMA-to-Cityscapes. Addi-tionally, our multi-modal UDA scheme is easy to port to other UDA methods with a consistent perfor-mance boost. (c) 2023 Elsevier Ltd. All rights reserved.

引用

页数：12

共 50 条

[31] Semantic Consistent Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation
Zeng, Guodong
Lerch, Till D.
Schmaranzer, Florian
Zheng, Guoyan
Burger, Juergen
Gerber, Kate
Tannast, Moritz
Siebenrock, Klaus
Gerber, Nicolas
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 201 - 210
[32] Differentiated Learning for Multi-Modal Domain Adaptation
Lv, Jianming
Liu, Kaijie
He, Shengfeng
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1322 - 1330
[33] Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation
Kim, Kyungmin
SENSORS, 2024, 24 (23)
[34] MULTI-MODAL SEMANTIC MESH SEGMENTATION IN URBAN SCENES
Laupheimer, Dominik
Haala, Norbert
XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 267 - 274
[35] A Multi-task Unsupervised Domain Adaptation Network for Medical Image Segmentation
Shi, Yuejing
Zhu, Fan
Peng, Yan
Ye, Zhen
Zhou, Chaozheng
INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND INTELLIGENT CONTROL (IPIC 2021), 2021, 11928
[36] Multi-modal brain tumor segmentation via conditional synthesis with Fourier domain adaptation
Al Khalil, Yasmina
Ayaz, Aymen
Lorenz, Cristian
Weese, Juergen
Pluim, Josien
Breeuwer, Marcel
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 112
[37] Unpaired multi-modal tumor segmentation with structure adaptation
Zhou, Pei
Chen, Houjin
Li, Yanfeng
Peng, Yahui
APPLIED INTELLIGENCE, 2023, 53 (04) : 3639 - 3651
[38] Unsupervised Multi-modal Style Transfer for Cardiac MR Segmentation
Chen, Chen
Ouyang, Cheng
Tarroni, Giacomo
Schlemper, Jo
Qiu, Huaqi
Bai, Wenjia
Rueckert, Daniel
STATISTICAL ATLASES AND COMPUTATIONAL MODELS OF THE HEART: MULTI-SEQUENCE CMR SEGMENTATION, CRT-EPIGGY AND LV FULL QUANTIFICATION CHALLENGES, 2020, 12009 : 209 - 219
[39] Unsupervised Trajectory Segmentation and Promoting of Multi-Modal Surgical Demonstrations
Shao, Zhenzhou
Zhao, Hongfa
Xie, Jiexin
Qu, Ying
Guan, Yong
Tan, Jindong
2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 777 - 782
[40] Unpaired multi-modal tumor segmentation with structure adaptation
Pei Zhou
Houjin Chen
Yanfeng Li
Yahui Peng
Applied Intelligence, 2023, 53 : 3639 - 3651

← 1 2 3 4 5 →