Decompose, Adjust, Compose: Effective Normalization by Playing with Frequency for Domain Generalization

被引:16
|
作者
Lee, Sangrok [1 ]
Bae, Jongseong [2 ]
Kim, Ha Young [1 ]
机构
[1] Yonsei Univ, Grad Sch Informat, Seoul, South Korea
[2] Yonsei Univ, Dept Artificial Intelligence, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
PHASE;
D O I
10.1109/CVPR52729.2023.01133
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain generalization (DG) is a principal task to evaluate the robustness of computer vision models. Many previous studies have used normalization for DG. In normalization, statistics and normalized features are regarded as style and content, respectively. However, it has a content variation problem when removing style because the boundary between content and style is unclear. This study addresses this problem from the frequency domain perspective, where amplitude and phase are considered as style and content, respectively. First, we verify the quantitative phase variation of normalization through the mathematical derivation of the Fourier transform formula. Then, based on this, we propose a novel normalization method, PC Norm, which eliminates style only as the preserving content through spectral decomposition. Furthermore, we propose advanced PC Norm variants, CC Norm and SC Norm, which adjust the degrees of variations in content and style, respectively. Thus, they can learn domain-agnostic representations for DG. With the normalization methods, we propose ResNet-variant models, DAC-P and DAC-SC, which are robust to the domain gap. The proposed models outperform other recent DG methods. The DAC-SC achieves an average state-of-the-art performance of 65.6% on five datasets: PACS, VLCS, Office-Home, DomainNet, and TerraIncognita.
引用
收藏
页码:11776 / 11785
页数:10
相关论文
共 50 条
  • [31] A generalization of frequency-domain stability criteria to a wider class of systems.
    Altshuller, DA
    PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 2657 - 2662
  • [32] MFAE: Masked Frequency Autoencoders for Domain Generalization Face Anti-Spoofing
    Zheng, Tianyi
    Li, Bo
    Wu, Shuang
    Wan, Ben
    Mu, Guodong
    Liu, Shice
    Ding, Shouhong
    Wang, Jia
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 (4058-4069) : 4058 - 4069
  • [33] Generalization of a total least squares problem in frequency-domain system identification
    Balogh, L
    Kollár, I
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2002, 51 (06) : 1353 - 1357
  • [34] Improving Domain Generalization for Sound Classification with Sparse Frequency-Regularized Transformer
    Mu, Honglin
    Xia, Wentian
    Che, Wanxiang
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1104 - 1108
  • [35] A THREE DIMENSIONAL MODEL WATERMARKING ALGORITHM IN FREQUENCY DOMAIN BASED ON THE NORMALIZATION OF HOST MODELS
    Liu, Chen-Chung
    Chen, Jiun-You
    Chung, Pei-Chung
    Yu, Shyr-Shen
    Tsui, Ta-Shan
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (5A): : 3299 - 3314
  • [36] Normalization of frequency-domain parameters of heart rate variability in patients with myotonic dystrophy
    Kurtoglu, Ertugrul
    Guven, Taner
    ANNALS OF NONINVASIVE ELECTROCARDIOLOGY, 2019, 24 (03)
  • [37] Normalization Method of Gradient Vector in Frequency Domain Steepest Descent Type Adaptive Algorithm
    Kuwahara, Yusuke
    Iwamatsu, Yusuke
    Fujii, Kensaku
    Muneyasu, Mitsuji
    Morimoto, Masakazu
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (11) : 2041 - 2045
  • [38] SS-Norm: Spectral-spatial normalization for single-domain generalization with application to retinal vessel segmentation
    Liu, Yi-Peng
    Zeng, Dongxu
    Li, Zhanqing
    Chen, Peng
    Liang, Ronghua
    IET IMAGE PROCESSING, 2023, 17 (07) : 2168 - 2181
  • [39] Coherent optical neural networks that have optical-frequency-controlled behavior and generalization ability in the frequency domain
    Hirose, A
    Eckmiller, R
    APPLIED OPTICS, 1996, 35 (05): : 836 - 843
  • [40] EFFECTIVE METRIC-BASED SPEAKER SEGMENTATION IN THE FREQUENCY DOMAIN
    Boehm, Christoph
    Pernkopf, Franz
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4081 - 4084