CrossMatch: Source-Free Domain Adaptive Semantic Segmentation via Cross-Modal Consistency Training

被引:1
|
作者
Yin, Yifang [1 ]
Hu, Wenmiao [2 ,4 ]
Liu, Zhenguang [3 ]
Wang, Guanfeng [4 ]
Xiang, Shili [1 ]
Zimmermann, Roger [2 ]
机构
[1] ASTAR, Inst Infocomm Res, Singapore, Singapore
[2] Natl Univ Singapore, Singapore, Singapore
[3] Zhejiang Gongshang Univ, Hangzhou, Peoples R China
[4] Grabtaxi Holdings Pte Ltd, Singapore, Singapore
关键词
D O I
10.1109/ICCV51070.2023.01991
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Source-free domain adaptive semantic segmentation has gained increasing attention recently. It eases the requirement of full access to the source domain by transferring knowledge only from a well-trained source model. However, reducing the uncertainty of the target pseudo labels becomes inevitably more challenging without the supervision of the labeled source data. In this work, we propose a novel asymmetric two-stream architecture that learns more robustly from noisy pseudo labels. Our approach simultaneously conducts dual-head pseudo label denoising and cross-modal consistency regularization. Towards the former, we introduce a multimodal auxiliary network during training (and discard it during inference), which effectively enhances the pseudo labels' correctness by leveraging the guidance from the depth information. Towards the latter, we enforce a new cross-modal pixel-wise consistency between the predictions of the two streams, encouraging our model to behave smoothly for both modality variance and image perturbations. It serves as an effective regularization to further reduce the impact of the inaccurate pseudo labels in source-free unsupervised domain adaptation. Experiments on GTA5. Cityscapes and SYNTHIA. Cityscapes benchmarks demonstrate the superiority of our proposed method, obtaining the new state-of-the-art mIoU of 57.7% and 57.5%, respectively.
引用
收藏
页码:21729 / 21739
页数:11
相关论文
共 50 条
  • [21] Fast discrete cross-modal hashing with semantic consistency
    Yao, Tao
    Yan, Lianshan
    Ma, Yilan
    Yu, Hong
    Su, Qingtang
    Wang, Gang
    Tian, Qi
    NEURAL NETWORKS, 2020, 125 (125) : 142 - 152
  • [22] Discriminative semantic transitive consistency for cross-modal learning
    Parida, Kranti Kumar
    Sharma, Gaurav
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 219
  • [23] Source-free domain adaptation for image segmentation
    Bateson, Mathilde
    Kervadec, Hoel
    Dolz, Jose
    Lombaert, Herve
    Ben Ayed, Ismail
    MEDICAL IMAGE ANALYSIS, 2022, 82
  • [24] Semantic Guidance Fusion Network for Cross-Modal Semantic Segmentation
    Zhang, Pan
    Chen, Ming
    Gao, Meng
    SENSORS, 2024, 24 (08)
  • [25] Cross-modal semantic transfer for point cloud semantic segmentation
    Cao, Zhen
    Mi, Xiaoxin
    Qiu, Bo
    Cao, Zhipeng
    Long, Chen
    Yan, Xinrui
    Zheng, Chao
    Dong, Zhen
    Yang, Bisheng
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 221 : 265 - 279
  • [26] Consistency Regularization for Generalizable Source-free Domain Adaptation
    Tang, Longxiang
    Li, Kai
    He, Chunming
    Zhang, Yulun
    Li, Xiu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4325 - 4335
  • [27] Domain Adaptive Cross-Modal Image Retrieval via Modality and Domain Translations
    Yanagi, Rintaro
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2021, E104A (06) : 866 - 875
  • [28] ADAPTIVE PSEUDO LABELING FOR SOURCE-FREE DOMAIN ADAPTATION IN MEDICAL IMAGE SEGMENTATION
    Li, Chen
    Chen, Wei
    Luo, Xin
    He, Yulin
    Tan, Yusong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1091 - 1095
  • [29] Cross-Modal Consistency for Single-Modal MR Image Segmentation
    Xu, Wenxuan
    Li, Cangxin
    Bian, Yun
    Meng, Qingquan
    Zhu, Weifang
    Shi, Fei
    Chen, Xinjian
    Shao, Chengwei
    Xiang, Dehui
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2024, 71 (09) : 2557 - 2567
  • [30] Source-Free Domain Adaptation for RGB-D Semantic Segmentation with Vision Transformers
    Rizzoli, Giulia
    Shenaj, Donald
    Zanuttigh, Pietro
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 607 - 616