Multi-modal unsupervised domain adaptation for semantic image segmentation

被引:15
|
作者
Hu, Sijie [1 ]
Bonardi, Fabien [1 ]
Bouchafa, Samia [1 ]
Sidibe, Desire [1 ]
机构
[1] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry Courcouronnes, France
关键词
Unsupervised domain adaptation; Multi -modal learning; Self -supervised learning; Knowledge transfer; Semantic segmentation;
D O I
10.1016/j.patcog.2022.109299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel multi-modal-based Unsupervised Domain Adaptation (UDA) method for semantic segmentation. Recently, depth has proven to be a relevent property for providing geometric cues to en-hance the RGB representation. However, existing UDA methods solely process RGB images or additionally cultivate depth-awareness with an auxiliary depth estimation task. We argue that geometric cues that are crucial to semantic segmentation, such as local shape and relative position, are challenging to recover from an auxiliary depth estimation task with mere color (RGB) information. In this paper, we propose a novel multi-modal UDA method named MMADT, which relies on both RGB and depth images as input. In particular, we design a Depth Fusion Block (DFB) to recalibrate depth information and leverage Depth Ad-versarial Training (DAT) to bridge the depth discrepancy between the source and target domain. Besides, we propose a self-supervised multi-modal depth estimation assistant network named Geo-Assistant (GA) to align the feature space of RGB and depth and shape the sensitivity of our MMADT to depth infor-mation. We experimentally observed significant performance improvement in multiple synthetic to real adaptation benchmarks, i.e., SYNTHIA-to-Cityscapes, GTA5-to-Cityscapes and SELMA-to-Cityscapes. Addi-tionally, our multi-modal UDA scheme is easy to port to other UDA methods with a consistent perfor-mance boost. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Multi-Modal Continual Test-Time Adaptation for 3D Semantic Segmentation
    Cao, Haozhi
    Xu, Yuecong
    Yang, Jianfei
    Yin, Pengyu
    Yuan, Shenghai
    Xie, Lihua
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18763 - 18773
  • [42] Unsupervised Domain Adaptation with Implicit Pseudo Supervision for Semantic Segmentation
    Xu, Wanyu
    Wang, Zengmao
    Bian, Wei
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [43] Bilateral Knowledge Distillation for Unsupervised Domain Adaptation of Semantic Segmentation
    Wang, Yunnan
    Li, Jianxun
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10177 - 10184
  • [44] Target-targeted Domain Adaptation for Unsupervised Semantic Segmentation
    Zhang, Xiaohong
    Zhang, Haofeng
    Lu, Jianfeng
    Shao, Ling
    Yang, Jingyu
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13560 - 13566
  • [45] VARIATIONAL AUTOENCODER BASED UNSUPERVISED DOMAIN ADAPTATION FOR SEMANTIC SEGMENTATION
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2426 - 2430
  • [46] Unsupervised Domain Adaptation for Semantic Segmentation with Global and Local Consistency
    Shan, Xiangxuan
    Yin, Zijin
    Gao, Jiayi
    Liang, Kongming
    Ma, Zhanyu
    Guo, Jun
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 154 - 165
  • [47] Latent Space Regularization for Unsupervised Domain Adaptation in Semantic Segmentation
    Barbato, Francesco
    Toldo, Marco
    Michieli, Umberto
    Zanuttigh, Pietro
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2829 - 2839
  • [48] Unsupervised Domain Adaptation for Semantic Segmentation using Depth Distribution
    Wu, Quanliang
    Liu, Huajun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [49] Enhanced Feature Alignment for Unsupervised Domain Adaptation of Semantic Segmentation
    Chen, Tao
    Wang, Shui-Hua
    Wang, Qiong
    Zhang, Zheng
    Xie, Guo-Sen
    Tang, Zhenmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1042 - 1054
  • [50] Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer
    Li, Weitao
    Gao, Hui
    Su, Yi
    Momanyi, Biffon Manyura
    REMOTE SENSING, 2022, 14 (19)