CrossMatch: Source-Free Domain Adaptive Semantic Segmentation via Cross-Modal Consistency Training

被引：1

作者：

Yin, Yifang ^{[1
]}

Hu, Wenmiao ^{[2
,4
]}

Liu, Zhenguang ^{[3
]}

Wang, Guanfeng ^{[4
]}

Xiang, Shili ^{[1
]}

Zimmermann, Roger ^{[2
]}

机构：

[1] ASTAR, Inst Infocomm Res, Singapore, Singapore

[2] Natl Univ Singapore, Singapore, Singapore

[3] Zhejiang Gongshang Univ, Hangzhou, Peoples R China

[4] Grabtaxi Holdings Pte Ltd, Singapore, Singapore

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

关键词：

D O I：

10.1109/ICCV51070.2023.01991

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Source-free domain adaptive semantic segmentation has gained increasing attention recently. It eases the requirement of full access to the source domain by transferring knowledge only from a well-trained source model. However, reducing the uncertainty of the target pseudo labels becomes inevitably more challenging without the supervision of the labeled source data. In this work, we propose a novel asymmetric two-stream architecture that learns more robustly from noisy pseudo labels. Our approach simultaneously conducts dual-head pseudo label denoising and cross-modal consistency regularization. Towards the former, we introduce a multimodal auxiliary network during training (and discard it during inference), which effectively enhances the pseudo labels' correctness by leveraging the guidance from the depth information. Towards the latter, we enforce a new cross-modal pixel-wise consistency between the predictions of the two streams, encouraging our model to behave smoothly for both modality variance and image perturbations. It serves as an effective regularization to further reduce the impact of the inaccurate pseudo labels in source-free unsupervised domain adaptation. Experiments on GTA5. Cityscapes and SYNTHIA. Cityscapes benchmarks demonstrate the superiority of our proposed method, obtaining the new state-of-the-art mIoU of 57.7% and 57.5%, respectively.

引用

页码：21729 / 21739

页数：11

共 50 条

[21] Fast discrete cross-modal hashing with semantic consistency
Yao, Tao
Yan, Lianshan
Ma, Yilan
Yu, Hong
Su, Qingtang
Wang, Gang
Tian, Qi
NEURAL NETWORKS, 2020, 125 (125) : 142 - 152
[22] Discriminative semantic transitive consistency for cross-modal learning
Parida, Kranti Kumar
Sharma, Gaurav
COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 219
[23] Source-free domain adaptation for image segmentation
Bateson, Mathilde
Kervadec, Hoel
Dolz, Jose
Lombaert, Herve
Ben Ayed, Ismail
MEDICAL IMAGE ANALYSIS, 2022, 82
[24] Semantic Guidance Fusion Network for Cross-Modal Semantic Segmentation
Zhang, Pan
Chen, Ming
Gao, Meng
SENSORS, 2024, 24 (08)
[25] Cross-modal semantic transfer for point cloud semantic segmentation
Cao, Zhen
Mi, Xiaoxin
Qiu, Bo
Cao, Zhipeng
Long, Chen
Yan, Xinrui
Zheng, Chao
Dong, Zhen
Yang, Bisheng
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 221 : 265 - 279
[26] Consistency Regularization for Generalizable Source-free Domain Adaptation
Tang, Longxiang
Li, Kai
He, Chunming
Zhang, Yulun
Li, Xiu
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4325 - 4335
[27] Domain Adaptive Cross-Modal Image Retrieval via Modality and Domain Translations
Yanagi, Rintaro
Togo, Ren
Ogawa, Takahiro
Haseyama, Miki
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2021, E104A (06) : 866 - 875
[28] ADAPTIVE PSEUDO LABELING FOR SOURCE-FREE DOMAIN ADAPTATION IN MEDICAL IMAGE SEGMENTATION
Li, Chen
Chen, Wei
Luo, Xin
He, Yulin
Tan, Yusong
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1091 - 1095
[29] Cross-Modal Consistency for Single-Modal MR Image Segmentation
Xu, Wenxuan
Li, Cangxin
Bian, Yun
Meng, Qingquan
Zhu, Weifang
Shi, Fei
Chen, Xinjian
Shao, Chengwei
Xiang, Dehui
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2024, 71 (09) : 2557 - 2567
[30] Source-Free Domain Adaptation for RGB-D Semantic Segmentation with Vision Transformers
Rizzoli, Giulia
Shenaj, Donald
Zanuttigh, Pietro
2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 607 - 616

← 1 2 3 4 5 →