Generalizing to unseen domains via PatchMix

被引:1
|
作者
Yang, Juncheng [1 ,3 ]
Li, Zuchao [2 ]
Li, Chao [4 ]
Xie, Shuai [5 ]
Yu, Wei [1 ]
Li, Shijun [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
[3] Henan Polytech Inst, Sch Elect & Informat Engn, Nanyang 473000, Henan, Peoples R China
[4] JD Hlth Int Inc, Beijing, Peoples R China
[5] JD Explore Acad, Beijing, Peoples R China
关键词
Domain generalization; PatchMix; Domain discriminator; Vision transformer; Data augmentation;
D O I
10.1007/s00530-023-01213-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Domain generalization (DG) aims to transfer knowledge learned from multiple source domains to unseen domains. One of the primary challenges hinders DG is the insufficient diversity of source domains, which hampers the model's ability to learn to generalize. Traditional data augmentation methods, which fuse content, style, labels, etc., unable to effectively learn the global features from the source domains. In this paper, we present an innovative approach to domain generalization learning technique, called PatchMix, by stitching the patches of different source domains together to build domain-mixup samples. This approach helps the model to learn the common features of different source domains. Meanwhile, a domain discriminator is introduced to preserve the model's ability to distinguish the source domains, which is proved to be helpful for the model to generalize to unseen domains. To our best knowledge, we are the first to unveil the equation that elucidates the correlation between the number of patches and the number of source domains. Our method, PatchMix, outperforms the current state-of-the-art (SOTA) on four benchmark datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Generalizing to unseen domains via PatchMix
    Juncheng Yang
    Zuchao Li
    Chao Li
    Shuai Xie
    Wei Yu
    Shijun Li
    Multimedia Systems, 2024, 30
  • [2] Generalizing to Unseen Domains via Adversarial Data Augmentation
    Volpi, Riccardo
    Namkoong, Hongseok
    Sener, Ozan
    Duchi, John
    Murino, Vittorio
    Savarese, Silvio
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] GradCa: Generalizing to unseen domains via gradient calibration
    Song, Yiguo
    Liu, Zhenyu
    Tang, Ruining
    Duan, Guifang
    Tan, Jianrong
    NEUROCOMPUTING, 2023, 529 : 1 - 10
  • [4] Generalizing to Unseen Domains: A Survey on Domain Generalization
    Wang, Jindong
    Lan, Cuiling
    Liu, Chang
    Ouyang, Yidong
    Qin, Tao
    Lu, Wang
    Chen, Yiqiang
    Zeng, Wenjun
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 8052 - 8072
  • [5] Towards Generalizing to Unseen Domains with Few Labels
    Galappaththige, Chamuditha Jayanga
    Baliah, Sanoojan
    Gunawardhana, Malitha
    Khan, Muhammad Haris
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 23691 - 23700
  • [6] Generalizing to Unseen Domains: A Survey on Domain Generalization
    Wang, Jindong
    Lan, Cuiling
    Liu, Chang
    Ouyang, Yidong
    Qin, Tao
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4627 - 4635
  • [7] DomainFusion: Generalizing to Unseen Domains with Latent Diffusion Models
    Huang, Yuyang
    Chen, Yabo
    Liu, Yuchen
    Zhang, Xiaopeng
    Dai, Wenrui
    Xiong, Hongkai
    Tian, Qi
    COMPUTER VISION - ECCV 2024, PT XLI, 2025, 15099 : 480 - 498
  • [8] Consistent Augmentation Learning for Generalizing CLIP to Unseen Domains
    Xuan, Qinan
    Yu, Tianyuan
    Bai, Liang
    Ruan, Yirun
    IEEE ACCESS, 2024, 12 : 167834 - 167844
  • [9] Generalizing to Unseen Domains in Diabetic Retinopathy with Disentangled Representations
    Xia, Peng
    Hu, Ming
    Tang, Feilong
    Li, Wenxue
    Zheng, Wenhao
    Ju, Lie
    Duan, Peibo
    Yao, Huaxiu
    Ge, Zongyuan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT X, 2024, 15010 : 427 - 437
  • [10] CODA: Generalizing to Open and Unseen Domains with Compaction and Disambiguation
    Chen, Chaoqi
    Tang, Luyao
    Huang, Yue
    Han, Xiaoguang
    Yu, Yizhou
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,