Multi-modal unsupervised domain adaptation for semantic image segmentation

被引:15
|
作者
Hu, Sijie [1 ]
Bonardi, Fabien [1 ]
Bouchafa, Samia [1 ]
Sidibe, Desire [1 ]
机构
[1] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry Courcouronnes, France
关键词
Unsupervised domain adaptation; Multi -modal learning; Self -supervised learning; Knowledge transfer; Semantic segmentation;
D O I
10.1016/j.patcog.2022.109299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel multi-modal-based Unsupervised Domain Adaptation (UDA) method for semantic segmentation. Recently, depth has proven to be a relevent property for providing geometric cues to en-hance the RGB representation. However, existing UDA methods solely process RGB images or additionally cultivate depth-awareness with an auxiliary depth estimation task. We argue that geometric cues that are crucial to semantic segmentation, such as local shape and relative position, are challenging to recover from an auxiliary depth estimation task with mere color (RGB) information. In this paper, we propose a novel multi-modal UDA method named MMADT, which relies on both RGB and depth images as input. In particular, we design a Depth Fusion Block (DFB) to recalibrate depth information and leverage Depth Ad-versarial Training (DAT) to bridge the depth discrepancy between the source and target domain. Besides, we propose a self-supervised multi-modal depth estimation assistant network named Geo-Assistant (GA) to align the feature space of RGB and depth and shape the sensitivity of our MMADT to depth infor-mation. We experimentally observed significant performance improvement in multiple synthetic to real adaptation benchmarks, i.e., SYNTHIA-to-Cityscapes, GTA5-to-Cityscapes and SELMA-to-Cityscapes. Addi-tionally, our multi-modal UDA scheme is easy to port to other UDA methods with a consistent perfor-mance boost. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Adversarial unsupervised domain adaptation for 3D semantic segmentation with multi-modal learning
    Liu, Wei
    Luo, Zhiming
    Cai, Yuanzheng
    Yu, Ying
    Ke, Yang
    Marcato Junior, Jose
    Goncalves, Wesley Nunes
    Li, Jonathan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 176 : 211 - 221
  • [2] Unsupervised domain adaptation multi-level adversarial network for semantic segmentation based on multi-modal features
    Wang Z.
    Bu S.
    Huang W.
    Zheng Y.
    Wu Q.
    Chang H.
    Zhang X.
    Tongxin Xuebao/Journal on Communications, 2022, 43 (12): : 157 - 171
  • [3] Multi-modal semantic image segmentation
    Pemasiri, Akila
    Kien Nguyen
    Sridharan, Sridha
    Fookes, Clinton
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 202
  • [4] Boosting Multi-Modal Unsupervised Domain Adaptation for LiDAR Semantic Segmentation by Self-Supervised Depth Completion
    Cardace, Adriano
    Conti, Andrea
    Ramirez, Pierluigi Zama
    Spezialetti, Riccardo
    Salti, Samuele
    Stefano, Luigi Di
    IEEE ACCESS, 2023, 11 : 85155 - 85164
  • [5] An Unsupervised Domain Adaptation Method for Multi-Modal Remote Sensing Image Classification
    Liu, Wei
    Qin, Rongjun
    Su, Fulin
    Hu, Kun
    2018 26TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS 2018), 2018,
  • [6] Adversarial Cross-modal Domain Adaptation for Multi-modal Semantic Segmentation in Autonomous Driving
    Shi, Mengqi
    Cao, Haozhi
    Xie, Lihua
    Yang, Jianfei
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 850 - 855
  • [7] MoPA: Multi-Modal Prior Aided Domain Adaptation for 3D Semantic Segmentation
    Cao, Haozhi
    Xu, Yuecong
    Yang, Jianfei
    Yin, Pengyu
    Yuan, Shenghai
    Xie, Lihua
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 9463 - 9470
  • [8] A multi-grained unsupervised domain adaptation approach for semantic segmentation
    Li, Luyang
    Ma, Tai
    Lu, Yue
    Li, Qingli
    He, Lianghua
    Wen, Ying
    PATTERN RECOGNITION, 2023, 144
  • [9] Style adaptation for avoiding semantic inconsistency in Unsupervised Domain Adaptation medical image segmentation
    Liu, Ziqiang
    Chen, Zhao-Min
    Chen, Huiling
    Teng, Shu
    Chen, Lei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105
  • [10] Unsupervised Domain Adaptation in Semantic Segmentation: A Review
    Toldo, Marco
    Maracani, Andrea
    Michieli, Umberto
    Zanuttigh, Pietro
    TECHNOLOGIES, 2020, 8 (02)