Decompose, Adjust, Compose: Effective Normalization by Playing with Frequency for Domain Generalization

被引:16
|
作者
Lee, Sangrok [1 ]
Bae, Jongseong [2 ]
Kim, Ha Young [1 ]
机构
[1] Yonsei Univ, Grad Sch Informat, Seoul, South Korea
[2] Yonsei Univ, Dept Artificial Intelligence, Seoul, South Korea
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
基金
新加坡国家研究基金会;
关键词
PHASE;
D O I
10.1109/CVPR52729.2023.01133
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain generalization (DG) is a principal task to evaluate the robustness of computer vision models. Many previous studies have used normalization for DG. In normalization, statistics and normalized features are regarded as style and content, respectively. However, it has a content variation problem when removing style because the boundary between content and style is unclear. This study addresses this problem from the frequency domain perspective, where amplitude and phase are considered as style and content, respectively. First, we verify the quantitative phase variation of normalization through the mathematical derivation of the Fourier transform formula. Then, based on this, we propose a novel normalization method, PC Norm, which eliminates style only as the preserving content through spectral decomposition. Furthermore, we propose advanced PC Norm variants, CC Norm and SC Norm, which adjust the degrees of variations in content and style, respectively. Thus, they can learn domain-agnostic representations for DG. With the normalization methods, we propose ResNet-variant models, DAC-P and DAC-SC, which are robust to the domain gap. The proposed models outperform other recent DG methods. The DAC-SC achieves an average state-of-the-art performance of 65.6% on five datasets: PACS, VLCS, Office-Home, DomainNet, and TerraIncognita.
引用
收藏
页码:11776 / 11785
页数:10
相关论文
共 50 条
  • [11] Improving Generalization of Drowsiness State Classification by Domain-Specific Normalization
    Kim, Dong-Young
    Han, Dong-Kyun
    Park, Seo-Hyeon
    Jang, Geun-Deok
    2024 12TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE, BCI 2024, 2024,
  • [12] Collaborative learning with normalization augmentation for domain generalization in time series classification
    He, Qi-Qiao
    Gong, Xueyuan
    Si, Yain-Whar
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [13] EXAMINATION OF NORMALIZATION METHODS FOR DATA ANALYSIS IN FREQUENCY DOMAIN
    Hales, M.
    Wang, Y. T.
    Johnson, B. F.
    MEDICINE AND SCIENCE IN SPORTS AND EXERCISE, 2001, 33 (05): : S84 - S84
  • [14] Generalization of the Fourier domain watermarking to the space/spatial-frequency domain
    Djurovic, I
    Stankovic, S
    Pitas, I
    Stankovic, L
    Tilp, J
    IWISPA 2000: PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2000, : 47 - 51
  • [15] Domain-generalization human activity recognition model based on CSI instance normalization
    Wang Y.
    Xu J.
    Wang A.
    Xia H.
    Zhao C.
    Ji Y.
    Tongxin Xuebao/Journal on Communications, 2024, 45 (06): : 196 - 209
  • [16] Face recognition based on illumination normalization in frequency-domain
    School of Computer Science, Sichuan University, Chengdu 610065, China
    Dianzi Keji Diaxue Xuebao, 2009, 6 (1021-1025):
  • [17] Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction
    Wang, Jingye
    Du, Ruoyi
    Chang, Dongliang
    Liang, Kongming
    Ma, Zhanyu
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4821 - 4829
  • [18] FedSR: A Simple and Effective Domain Generalization Method for Federated Learning
    Nguyen, A. Tuan
    Torr, Philip H. S.
    Lim, Ser-Nam
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [19] Frequency domain blind source separation of a reduced amount of data using frequency normalization
    Robledo-Arnuncio, Enrique
    Sawada, Hiroshi
    Makino, Shoji
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 5695 - 5698
  • [20] FREQUENCY-DOMAIN CHARACTERISTICS OF LINEAR OPERATOR TO DECOMPOSE A TIME-SERIES INTO THE MULTI-COMPONENTS
    HIGUCHI, T
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1991, 43 (03) : 469 - 492