Cross-Domain Transformer with Adaptive Thresholding for Domain Adaptive Semantic Segmentation

被引:0
|
作者
Liu, Quansheng [1 ]
Wang, Lei [1 ]
Jun, Yu [1 ]
Gao, Fang [2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei, Peoples R China
[2] Guangxi Univ, Sch Elect Engn, Nanning, Peoples R China
关键词
Domain Adaptation; Semantic Segmentation; Transformer; Attention mechanism;
D O I
10.1007/978-3-031-44198-1_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of unsupervised domain adaptive semantic segmentation (UDA-SS) is to learn a model using annotated data from the source domain and generate accurate dense predictions for the unlabeled target domain. UDA methods based on Transformer utilize self-attention mechanism to learn features within source and target domains. However, in the presence of significant distribution shift between the two domains, the noisy pseudo-labels could hinder the model's adaptation to the target domain. In this work, we proposed to incorporate self-attention and cross-domain attention to learn domain-invariant features. Specifically, we design a weight-sharing multi-branch cross-domain Transformer, where the cross-domain branch is used to align domains at the feature level with the aid of cross-domain attention. Moreover, we introduce an adaptive thresholding strategy for pseudo-label selection, which dynamically adjusts the proportion of pseudo-labels that are used in training based on the model's adaptation status. Our approach guarantees the reliability of the pseudo labels while allowing more target domain samples to contribute to model training. Extensive experiments show that our proposed method consistently outperforms the baseline and achieves competitive results on GTA5 -> Cityscapes, Synthia -> Cityscapes, and Cityscapes -> ACDC benchmark.
引用
收藏
页码:147 / 159
页数:13
相关论文
共 50 条
  • [31] Cross-Domain Semantic Segmentation on Inconsistent Taxonomy Using VLMs
    Lim, Jeongkee
    Kim, Yusung
    COMPUTER VISION - ECCV 2024, PT LXV, 2025, 15123 : 18 - 35
  • [32] Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation
    Jiang, Zhengkai
    Li, Yuxi
    Yang, Ceyuan
    Gao, Peng
    Wang, Yabiao
    Tai, Ying
    Wang, Chengjie
    COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 36 - 54
  • [33] Unsupervised Domain Adaptive Point Cloud Semantic Segmentation
    Bian, Yikai
    Xie, Jin
    Qian, Jianjun
    PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 285 - 298
  • [34] DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation
    Lai, Xin
    Tian, Zhuotao
    Xu, Xiaogang
    Chen, Yingcong
    Liu, Shu
    Zhao, Hengshuang
    Wang, Liwei
    Jia, Jiaya
    COMPUTER VISION - ECCV 2022, PT XXXIII, 2022, 13693 : 369 - 387
  • [35] Prototypical Contrastive Learning for Domain Adaptive Semantic Segmentation
    Liu, Quansheng
    Pu, Chengdao
    Gao, Fang
    Yu, Jun
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [36] Alignment and fusion for adaptive domain nighttime semantic segmentation
    Zhang, Bao
    Yao, Nianmin
    Zhao, Jian
    Zhang, Yanan
    IMAGE AND VISION COMPUTING, 2024, 146
  • [37] CLASSIFICATION CONSTRAINED DISCRIMINATOR FOR DOMAIN ADAPTIVE SEMANTIC SEGMENTATION
    Chen, Tao
    Zhang, Jian
    Xie, Guo-Sen
    Yao, Yazhou
    Huang, Xiaoshui
    Tang, Zhenmin
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [38] Explore Epistemic Uncertainty in Domain Adaptive Semantic Segmentation
    Yao, Kai
    Su, Zixian
    Yang, Xi
    Sun, Jie
    Huang, Kaizhu
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2990 - 2998
  • [39] Domain Adaptive Semantic Segmentation without Source Data
    You, Fuming
    Li, Jingjing
    Zhu, Lei
    Chen, Zhi
    Huang, Zi
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3293 - 3302
  • [40] Domain Adaptive Semantic Segmentation Through Structure Enhancement
    Lv, Fengmao
    Lian, Qing
    Yang, Guowu
    Lin, Guosheng
    Pan, Sinno Jialin
    Duan, Lixin
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 172 - 179