Multi-modal unsupervised domain adaptation for semantic image segmentation

被引：15

作者：

Hu, Sijie ^{[1
]}

Bonardi, Fabien ^{[1
]}

Bouchafa, Samia ^{[1
]}

Sidibe, Desire ^{[1
]}

机构：

[1] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry Courcouronnes, France

来源：

PATTERN RECOGNITION | 2023年 / 137卷

关键词：

Unsupervised domain adaptation; Multi -modal learning; Self -supervised learning; Knowledge transfer; Semantic segmentation;

D O I：

10.1016/j.patcog.2022.109299

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel multi-modal-based Unsupervised Domain Adaptation (UDA) method for semantic segmentation. Recently, depth has proven to be a relevent property for providing geometric cues to en-hance the RGB representation. However, existing UDA methods solely process RGB images or additionally cultivate depth-awareness with an auxiliary depth estimation task. We argue that geometric cues that are crucial to semantic segmentation, such as local shape and relative position, are challenging to recover from an auxiliary depth estimation task with mere color (RGB) information. In this paper, we propose a novel multi-modal UDA method named MMADT, which relies on both RGB and depth images as input. In particular, we design a Depth Fusion Block (DFB) to recalibrate depth information and leverage Depth Ad-versarial Training (DAT) to bridge the depth discrepancy between the source and target domain. Besides, we propose a self-supervised multi-modal depth estimation assistant network named Geo-Assistant (GA) to align the feature space of RGB and depth and shape the sensitivity of our MMADT to depth infor-mation. We experimentally observed significant performance improvement in multiple synthetic to real adaptation benchmarks, i.e., SYNTHIA-to-Cityscapes, GTA5-to-Cityscapes and SELMA-to-Cityscapes. Addi-tionally, our multi-modal UDA scheme is easy to port to other UDA methods with a consistent perfor-mance boost. (c) 2023 Elsevier Ltd. All rights reserved.

引用

页数：12

共 50 条

[41] Multi-Modal Continual Test-Time Adaptation for 3D Semantic Segmentation
Cao, Haozhi
Xu, Yuecong
Yang, Jianfei
Yin, Pengyu
Yuan, Shenghai
Xie, Lihua
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18763 - 18773
[42] Unsupervised Domain Adaptation with Implicit Pseudo Supervision for Semantic Segmentation
Xu, Wanyu
Wang, Zengmao
Bian, Wei
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[43] Bilateral Knowledge Distillation for Unsupervised Domain Adaptation of Semantic Segmentation
Wang, Yunnan
Li, Jianxun
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10177 - 10184
[44] Target-targeted Domain Adaptation for Unsupervised Semantic Segmentation
Zhang, Xiaohong
Zhang, Haofeng
Lu, Jianfeng
Shao, Ling
Yang, Jingyu
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13560 - 13566
[45] VARIATIONAL AUTOENCODER BASED UNSUPERVISED DOMAIN ADAPTATION FOR SEMANTIC SEGMENTATION
Li, Zongyao
Togo, Ren
Ogawa, Takahiro
Haseyama, Miki
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2426 - 2430
[46] Unsupervised Domain Adaptation for Semantic Segmentation with Global and Local Consistency
Shan, Xiangxuan
Yin, Zijin
Gao, Jiayi
Liang, Kongming
Ma, Zhanyu
Guo, Jun
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 154 - 165
[47] Latent Space Regularization for Unsupervised Domain Adaptation in Semantic Segmentation
Barbato, Francesco
Toldo, Marco
Michieli, Umberto
Zanuttigh, Pietro
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2829 - 2839
[48] Unsupervised Domain Adaptation for Semantic Segmentation using Depth Distribution
Wu, Quanliang
Liu, Huajun
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[49] Enhanced Feature Alignment for Unsupervised Domain Adaptation of Semantic Segmentation
Chen, Tao
Wang, Shui-Hua
Wang, Qiong
Zhang, Zheng
Xie, Guo-Sen
Tang, Zhenmin
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1042 - 1054
[50] Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
Li, Weitao
Gao, Hui
Su, Yi
Momanyi, Biffon Manyura
REMOTE SENSING, 2022, 14 (19)

← 1 2 3 4 5 →