Cross-Domain Transformer with Adaptive Thresholding for Domain Adaptive Semantic Segmentation

被引:0
|
作者
Liu, Quansheng [1 ]
Wang, Lei [1 ]
Jun, Yu [1 ]
Gao, Fang [2 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei, Peoples R China
[2] Guangxi Univ, Sch Elect Engn, Nanning, Peoples R China
关键词
Domain Adaptation; Semantic Segmentation; Transformer; Attention mechanism;
D O I
10.1007/978-3-031-44198-1_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of unsupervised domain adaptive semantic segmentation (UDA-SS) is to learn a model using annotated data from the source domain and generate accurate dense predictions for the unlabeled target domain. UDA methods based on Transformer utilize self-attention mechanism to learn features within source and target domains. However, in the presence of significant distribution shift between the two domains, the noisy pseudo-labels could hinder the model's adaptation to the target domain. In this work, we proposed to incorporate self-attention and cross-domain attention to learn domain-invariant features. Specifically, we design a weight-sharing multi-branch cross-domain Transformer, where the cross-domain branch is used to align domains at the feature level with the aid of cross-domain attention. Moreover, we introduce an adaptive thresholding strategy for pseudo-label selection, which dynamically adjusts the proportion of pseudo-labels that are used in training based on the model's adaptation status. Our approach guarantees the reliability of the pseudo labels while allowing more target domain samples to contribute to model training. Extensive experiments show that our proposed method consistently outperforms the baseline and achieves competitive results on GTA5 -> Cityscapes, Synthia -> Cityscapes, and Cityscapes -> ACDC benchmark.
引用
收藏
页码:147 / 159
页数:13
相关论文
共 50 条
  • [41] CATEGORY-ADAPTIVE DOMAIN ADAPTATION FOR SEMANTIC SEGMENTATION
    Wang, Zhiming
    Luo, Yantian
    Huang, Danlan
    Ge, Ning
    Lu, Jianhua
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3773 - 3777
  • [42] Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer
    Lv, Fengmao
    Liang, Tao
    Chen, Xiang
    Lin, Guosheng
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4333 - 4342
  • [43] Adaptive Adversarial Contrastive Learning for Cross-Domain Recommendation
    Hsu, Chi-Wei
    Chen, Chiao-Ting
    Huang, Szu-Hao
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
  • [44] Cross-Domain Label-Adaptive Stance Detection
    Hardalov, Momchil
    Arora, Arnav
    Nakov, Preslav
    Augenstein, Isabelle
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9011 - 9028
  • [45] Cross-domain Image Localization by Adaptive Feature Fusion
    Bhowmik, Neelanjan
    Weng, Li
    Gouet-Brunet, Valerie
    Soheilian, Bahman
    2017 JOINT URBAN REMOTE SENSING EVENT (JURSE), 2017,
  • [46] Adaptive Iterated Local Search for Cross-domain Optimisation
    Burke, Edmund K.
    Gendreau, Michel
    Ochoa, Gabriela
    Walker, James D.
    GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 1987 - 1994
  • [47] Cross-Domain Object Detection by Dual Adaptive Branch
    Liu, Xinyi
    Zhang, Baofeng
    Liu, Na
    SENSORS, 2023, 23 (03)
  • [48] AdaReX: Cross-Domain, Adaptive and Explainable Recommender System
    Yu, Yi
    Sugiyama, Kazunari
    Jatowt, Adam
    ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL IN THE ASIA PACIFIC REGION, SIGIR-AP 2023, 2023, : 272 - 281
  • [49] Cross-Domain Attention Alignment for Domain Adaptive Person re-ID
    Zhang, Zhen
    Wang, Wei
    Kane, Guoliang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XII, 2025, 15042 : 114 - 127
  • [50] Cross-Domain Palmprint Recognition via Regularized Adversarial Domain Adaptive Hashing
    Du, Xuefeng
    Zhong, Dexing
    Shao, Huikai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2372 - 2385