Multi-modal unsupervised domain adaptation for semantic image segmentation

被引:15
|
作者
Hu, Sijie [1 ]
Bonardi, Fabien [1 ]
Bouchafa, Samia [1 ]
Sidibe, Desire [1 ]
机构
[1] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry Courcouronnes, France
关键词
Unsupervised domain adaptation; Multi -modal learning; Self -supervised learning; Knowledge transfer; Semantic segmentation;
D O I
10.1016/j.patcog.2022.109299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel multi-modal-based Unsupervised Domain Adaptation (UDA) method for semantic segmentation. Recently, depth has proven to be a relevent property for providing geometric cues to en-hance the RGB representation. However, existing UDA methods solely process RGB images or additionally cultivate depth-awareness with an auxiliary depth estimation task. We argue that geometric cues that are crucial to semantic segmentation, such as local shape and relative position, are challenging to recover from an auxiliary depth estimation task with mere color (RGB) information. In this paper, we propose a novel multi-modal UDA method named MMADT, which relies on both RGB and depth images as input. In particular, we design a Depth Fusion Block (DFB) to recalibrate depth information and leverage Depth Ad-versarial Training (DAT) to bridge the depth discrepancy between the source and target domain. Besides, we propose a self-supervised multi-modal depth estimation assistant network named Geo-Assistant (GA) to align the feature space of RGB and depth and shape the sensitivity of our MMADT to depth infor-mation. We experimentally observed significant performance improvement in multiple synthetic to real adaptation benchmarks, i.e., SYNTHIA-to-Cityscapes, GTA5-to-Cityscapes and SELMA-to-Cityscapes. Addi-tionally, our multi-modal UDA scheme is easy to port to other UDA methods with a consistent perfor-mance boost. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Multichannel Semantic Segmentation with Unsupervised Domain Adaptation
    Watanabe, Kohei
    Saito, Kuniaki
    Ushiku, Yoshitaka
    Harada, Tatsuya
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT V, 2019, 11133 : 600 - 616
  • [12] Geometric Unsupervised Domain Adaptation for Semantic Segmentation
    Guizilini, Vitor
    Li, Jie
    Ambrus, Rares
    Gaidon, Adrien
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8517 - 8527
  • [13] Rethinking unsupervised domain adaptation for semantic segmentation
    Wang, Zhijie
    Suganuma, Masanori
    Okatani, Takayuki
    PATTERN RECOGNITION LETTERS, 2024, 186 : 119 - 125
  • [14] Unsupervised Domain Adaptation for Referring Semantic Segmentation
    Shi, Haonan
    Pan, Wenwen
    Zhao, Zhou
    Zhang, Mingmin
    Wu, Fei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5807 - 5818
  • [15] Adapt Everywhere: Unsupervised Adaptation of Point-Clouds and Entropy Minimization for Multi-Modal Cardiac Image Segmentation
    Vesal, Sulaiman
    Gu, Mingxuan
    Kosti, Ronak
    Maier, Andreas
    Ravikumar, Nishant
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (07) : 1838 - 1851
  • [16] MS-UDA: Multi-Spectral Unsupervised Domain Adaptation for Thermal Image Semantic Segmentation
    Kim, Yeong-Hyeon
    Shin, Ukcheol
    Park, Jinsun
    Kweon, In So
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 6497 - 6504
  • [17] Multi-Head Distillation for Continual Unsupervised Domain Adaptation in Semantic Segmentation
    Saporta, Antoine
    Douillard, Arthur
    Vu, Tuan-Hung
    Perez, Patrick
    Cord, Matthieu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3750 - 3759
  • [18] A framework for unsupervised segmentation of multi-modal medical images
    El-Baz, Ayman
    Farag, Aly
    Ali, Asem
    Gimel'farb, Georgy
    Casanova, Manuel
    COMPUTER VISION APPROACHES TO MEDICAL IMAGE ANALYSIS, 2006, 4241 : 120 - 131
  • [19] Towards Unsupervised Online Domain Adaptation for Semantic Segmentation
    Kuznietsov, Yevhen
    Proesmans, Marc
    Van Gool, Luc
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 261 - 271
  • [20] Unsupervised Adversarial Domain Adaptation Network for Semantic Segmentation
    Liu, Wei
    Su, Fulin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (11) : 1978 - 1982