Multi-modal unsupervised domain adaptation for semantic image segmentation

被引：15

作者：

Hu, Sijie ^{[1
]}

Bonardi, Fabien ^{[1
]}

Bouchafa, Samia ^{[1
]}

Sidibe, Desire ^{[1
]}

机构：

[1] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry Courcouronnes, France

来源：

PATTERN RECOGNITION | 2023年 / 137卷

关键词：

Unsupervised domain adaptation; Multi -modal learning; Self -supervised learning; Knowledge transfer; Semantic segmentation;

D O I：

10.1016/j.patcog.2022.109299

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel multi-modal-based Unsupervised Domain Adaptation (UDA) method for semantic segmentation. Recently, depth has proven to be a relevent property for providing geometric cues to en-hance the RGB representation. However, existing UDA methods solely process RGB images or additionally cultivate depth-awareness with an auxiliary depth estimation task. We argue that geometric cues that are crucial to semantic segmentation, such as local shape and relative position, are challenging to recover from an auxiliary depth estimation task with mere color (RGB) information. In this paper, we propose a novel multi-modal UDA method named MMADT, which relies on both RGB and depth images as input. In particular, we design a Depth Fusion Block (DFB) to recalibrate depth information and leverage Depth Ad-versarial Training (DAT) to bridge the depth discrepancy between the source and target domain. Besides, we propose a self-supervised multi-modal depth estimation assistant network named Geo-Assistant (GA) to align the feature space of RGB and depth and shape the sensitivity of our MMADT to depth infor-mation. We experimentally observed significant performance improvement in multiple synthetic to real adaptation benchmarks, i.e., SYNTHIA-to-Cityscapes, GTA5-to-Cityscapes and SELMA-to-Cityscapes. Addi-tionally, our multi-modal UDA scheme is easy to port to other UDA methods with a consistent perfor-mance boost. (c) 2023 Elsevier Ltd. All rights reserved.

引用

页数：12

共 50 条

[1] Adversarial unsupervised domain adaptation for 3D semantic segmentation with multi-modal learning
Liu, Wei
Luo, Zhiming
Cai, Yuanzheng
Yu, Ying
Ke, Yang
Marcato Junior, Jose
Goncalves, Wesley Nunes
Li, Jonathan
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 176 : 211 - 221
[2] Unsupervised domain adaptation multi-level adversarial network for semantic segmentation based on multi-modal features
Wang Z.
Bu S.
Huang W.
Zheng Y.
Wu Q.
Chang H.
Zhang X.
Tongxin Xuebao/Journal on Communications, 2022, 43 (12): : 157 - 171
[3] Multi-modal semantic image segmentation
Pemasiri, Akila
Kien Nguyen
Sridharan, Sridha
Fookes, Clinton
COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 202
[4] Boosting Multi-Modal Unsupervised Domain Adaptation for LiDAR Semantic Segmentation by Self-Supervised Depth Completion
Cardace, Adriano
Conti, Andrea
Ramirez, Pierluigi Zama
Spezialetti, Riccardo
Salti, Samuele
Stefano, Luigi Di
IEEE ACCESS, 2023, 11 : 85155 - 85164
[5] An Unsupervised Domain Adaptation Method for Multi-Modal Remote Sensing Image Classification
Liu, Wei
Qin, Rongjun
Su, Fulin
Hu, Kun
2018 26TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS 2018), 2018,
[6] Adversarial Cross-modal Domain Adaptation for Multi-modal Semantic Segmentation in Autonomous Driving
Shi, Mengqi
Cao, Haozhi
Xie, Lihua
Yang, Jianfei
2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 850 - 855
[7] MoPA: Multi-Modal Prior Aided Domain Adaptation for 3D Semantic Segmentation
Cao, Haozhi
Xu, Yuecong
Yang, Jianfei
Yin, Pengyu
Yuan, Shenghai
Xie, Lihua
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 9463 - 9470
[8] A multi-grained unsupervised domain adaptation approach for semantic segmentation
Li, Luyang
Ma, Tai
Lu, Yue
Li, Qingli
He, Lianghua
Wen, Ying
PATTERN RECOGNITION, 2023, 144
[9] Style adaptation for avoiding semantic inconsistency in Unsupervised Domain Adaptation medical image segmentation
Liu, Ziqiang
Chen, Zhao-Min
Chen, Huiling
Teng, Shu
Chen, Lei
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105
[10] Unsupervised Domain Adaptation in Semantic Segmentation: A Review
Toldo, Marco
Maracani, Andrea
Michieli, Umberto
Zanuttigh, Pietro
TECHNOLOGIES, 2020, 8 (02)

← 1 2 3 4 5 →