Generalizing to unseen domains via PatchMix

被引:1
|
作者
Yang, Juncheng [1 ,3 ]
Li, Zuchao [2 ]
Li, Chao [4 ]
Xie, Shuai [5 ]
Yu, Wei [1 ]
Li, Shijun [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
[3] Henan Polytech Inst, Sch Elect & Informat Engn, Nanyang 473000, Henan, Peoples R China
[4] JD Hlth Int Inc, Beijing, Peoples R China
[5] JD Explore Acad, Beijing, Peoples R China
关键词
Domain generalization; PatchMix; Domain discriminator; Vision transformer; Data augmentation;
D O I
10.1007/s00530-023-01213-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Domain generalization (DG) aims to transfer knowledge learned from multiple source domains to unseen domains. One of the primary challenges hinders DG is the insufficient diversity of source domains, which hampers the model's ability to learn to generalize. Traditional data augmentation methods, which fuse content, style, labels, etc., unable to effectively learn the global features from the source domains. In this paper, we present an innovative approach to domain generalization learning technique, called PatchMix, by stitching the patches of different source domains together to build domain-mixup samples. This approach helps the model to learn the common features of different source domains. Meanwhile, a domain discriminator is introduced to preserve the model's ability to distinguish the source domains, which is proved to be helpful for the model to generalize to unseen domains. To our best knowledge, we are the first to unveil the equation that elucidates the correlation between the number of patches and the number of source domains. Our method, PatchMix, outperforms the current state-of-the-art (SOTA) on four benchmark datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Generalizing across Temporal Domains with Koopman Operators
    Zeng, Qiuhao
    Wang, Wei
    Zhou, Fan
    Xu, Gezheng
    Pu, Ruizhi
    Shui, Changjian
    Gagne, Christian
    Yang, Shichun
    Ling, Charles X.
    Wang, Boyu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16651 - 16659
  • [32] Generalization on Unseen Domains via Inference-time Label-Preserving Target Projections
    Pandey, Prashant
    Raman, Mrigank
    Varambally, Sumanth
    Prathosh, A. P.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12919 - 12928
  • [33] Generalizing state-of-the-art object detectors for autonomous vehicles in unseen environments
    Khosravian, Amir
    Amirkhani, Abdollah
    Kashiani, Hossein
    Masih-Tehrani, Masoud
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
  • [34] Generalizing Neural Human Fitting to Unseen Poses With Articulated SE(3) Equivariance
    Feng, Haiwen
    Kulits, Peter
    Liu, Shichen
    Black, Michael J.
    Abrevaya, Victoria
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 7943 - 7954
  • [35] Learning Meta Face Recognition in Unseen Domains
    Guo, Jianzhu
    Zhu, Xiangyu
    Zhao, Chenxu
    Cao, Dong
    Lei, Zhen
    Li, Stan Z.
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6162 - 6171
  • [36] Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration
    Huang, De-An
    Nair, Suraj
    Xu, Danfei
    Zhu, Yuke
    Garg, Animesh
    Li Fei-Fei
    Savarese, Silvio
    Niebles, Juan Carlos
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8557 - 8566
  • [37] Generalizing to Unseen Entities and Entity Pairs with Row-less Universal Schema
    Verga, Patrick
    Neelakantan, Arvind
    McCallum, Andrew
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 613 - 622
  • [38] Generalizing programs via subsumption
    Gutiérrez-Naranjo, MA
    Alonso-Jiménez, JA
    Borrego-Díaz, J
    COMPUTER AIDED SYSTEMS THEORY - EUROCAST 2003, 2003, 2809 : 115 - 126
  • [39] Taxonomy Construction of Unseen Domains via Graph-based Cross-Domain Knowledge Transfer
    Shang, Chao
    Dash, Sarthak
    Chowdhury, Faisal Mahbub
    Mihindukulasooriya, Nandana
    Gliozzo, Alfio
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2198 - 2208
  • [40] VideoDG: Generalizing Temporal Relations in Videos to Novel Domains
    Yao, Zhiyu
    Wang, Yunbo
    Wang, Jianmin
    Yu, Philip S.
    Long, Mingsheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 7989 - 8004