Multi-modal unsupervised domain adaptation for semantic image segmentation

被引:15
|
作者
Hu, Sijie [1 ]
Bonardi, Fabien [1 ]
Bouchafa, Samia [1 ]
Sidibe, Desire [1 ]
机构
[1] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry Courcouronnes, France
关键词
Unsupervised domain adaptation; Multi -modal learning; Self -supervised learning; Knowledge transfer; Semantic segmentation;
D O I
10.1016/j.patcog.2022.109299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel multi-modal-based Unsupervised Domain Adaptation (UDA) method for semantic segmentation. Recently, depth has proven to be a relevent property for providing geometric cues to en-hance the RGB representation. However, existing UDA methods solely process RGB images or additionally cultivate depth-awareness with an auxiliary depth estimation task. We argue that geometric cues that are crucial to semantic segmentation, such as local shape and relative position, are challenging to recover from an auxiliary depth estimation task with mere color (RGB) information. In this paper, we propose a novel multi-modal UDA method named MMADT, which relies on both RGB and depth images as input. In particular, we design a Depth Fusion Block (DFB) to recalibrate depth information and leverage Depth Ad-versarial Training (DAT) to bridge the depth discrepancy between the source and target domain. Besides, we propose a self-supervised multi-modal depth estimation assistant network named Geo-Assistant (GA) to align the feature space of RGB and depth and shape the sensitivity of our MMADT to depth infor-mation. We experimentally observed significant performance improvement in multiple synthetic to real adaptation benchmarks, i.e., SYNTHIA-to-Cityscapes, GTA5-to-Cityscapes and SELMA-to-Cityscapes. Addi-tionally, our multi-modal UDA scheme is easy to port to other UDA methods with a consistent perfor-mance boost. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Semantic Consistent Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation
    Zeng, Guodong
    Lerch, Till D.
    Schmaranzer, Florian
    Zheng, Guoyan
    Burger, Juergen
    Gerber, Kate
    Tannast, Moritz
    Siebenrock, Klaus
    Gerber, Nicolas
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 201 - 210
  • [32] Differentiated Learning for Multi-Modal Domain Adaptation
    Lv, Jianming
    Liu, Kaijie
    He, Shengfeng
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1322 - 1330
  • [33] Pseudo Multi-Modal Approach to LiDAR Semantic Segmentation
    Kim, Kyungmin
    SENSORS, 2024, 24 (23)
  • [34] MULTI-MODAL SEMANTIC MESH SEGMENTATION IN URBAN SCENES
    Laupheimer, Dominik
    Haala, Norbert
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 267 - 274
  • [35] A Multi-task Unsupervised Domain Adaptation Network for Medical Image Segmentation
    Shi, Yuejing
    Zhu, Fan
    Peng, Yan
    Ye, Zhen
    Zhou, Chaozheng
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND INTELLIGENT CONTROL (IPIC 2021), 2021, 11928
  • [36] Multi-modal brain tumor segmentation via conditional synthesis with Fourier domain adaptation
    Al Khalil, Yasmina
    Ayaz, Aymen
    Lorenz, Cristian
    Weese, Juergen
    Pluim, Josien
    Breeuwer, Marcel
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 112
  • [37] Unpaired multi-modal tumor segmentation with structure adaptation
    Zhou, Pei
    Chen, Houjin
    Li, Yanfeng
    Peng, Yahui
    APPLIED INTELLIGENCE, 2023, 53 (04) : 3639 - 3651
  • [38] Unsupervised Multi-modal Style Transfer for Cardiac MR Segmentation
    Chen, Chen
    Ouyang, Cheng
    Tarroni, Giacomo
    Schlemper, Jo
    Qiu, Huaqi
    Bai, Wenjia
    Rueckert, Daniel
    STATISTICAL ATLASES AND COMPUTATIONAL MODELS OF THE HEART: MULTI-SEQUENCE CMR SEGMENTATION, CRT-EPIGGY AND LV FULL QUANTIFICATION CHALLENGES, 2020, 12009 : 209 - 219
  • [39] Unsupervised Trajectory Segmentation and Promoting of Multi-Modal Surgical Demonstrations
    Shao, Zhenzhou
    Zhao, Hongfa
    Xie, Jiexin
    Qu, Ying
    Guan, Yong
    Tan, Jindong
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 777 - 782
  • [40] Unpaired multi-modal tumor segmentation with structure adaptation
    Pei Zhou
    Houjin Chen
    Yanfeng Li
    Yahui Peng
    Applied Intelligence, 2023, 53 : 3639 - 3651