Focal Frequency Loss for Image Reconstruction and Synthesis

被引:158
|
作者
Jiang, Liming [1 ]
Dai, Bo [1 ]
Wu, Wayne [2 ]
Loy, Chen Change [1 ]
机构
[1] Nanyang Technol Univ, S Lab, Singapore, Singapore
[2] SenseTime Res, Hong Kong, Peoples R China
关键词
D O I
10.1109/ICCV48922.2021.01366
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image reconstruction and synthesis have witnessed remarkable progress thanks to the development of generative models. Nonetheless, gaps could still exist between the real and generated images, especially in the frequency domain. In this study, we show that narrowing gaps in the frequency domain can ameliorate image reconstruction and synthesis quality further. We propose a novel focal frequency loss, which allows a model to adaptively focus on frequency components that are hard to synthesize by down-weighting the easy ones. This objective function is complementary to existing spatial losses, offering great impedance against the loss of important frequency information due to the inherent bias of neural networks. We demonstrate the versatility and effectiveness of focal frequency loss to improve popular models, such as VAE, pix2pix, and SPADE, in both perceptual quality and quantitative performance. We further show its potential on StyleGAN2.1,
引用
收藏
页码:13899 / 13909
页数:11
相关论文
共 50 条
  • [31] PANORAMIC IMAGE INPAINTING WITH GATED CONVOLUTION AND CONTEXTUAL RECONSTRUCTION LOSS
    Yu, Li
    Gao, Yanjun
    Pakdaman, Farhad
    Gabbouj, Moncef
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4255 - 4259
  • [32] Photovoltaic Output Loss Forecasting Based on Image Correction and Reconstruction
    Wu, Yunyi
    Wang, Sen
    Sun, Yonghui
    Zhang, Wenjie
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2024, 48 (20): : 130 - 139
  • [33] Unsupervised image-to-image translation using intra-domain reconstruction loss
    Yuan Fan
    Mingwen Shao
    Wangmeng Zuo
    Qingyun Li
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 2077 - 2088
  • [34] Unsupervised image-to-image translation using intra-domain reconstruction loss
    Fan, Yuan
    Shao, Mingwen
    Zuo, Wangmeng
    Li, Qingyun
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (09) : 2077 - 2088
  • [35] Topology-Aware Focal Loss for 3D Image Segmentation
    Demir, Andac
    Massaad, Elie
    Kiziltan, Bulent
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 580 - 589
  • [36] Multiple frequency image reconstruction using diffusing sources.
    Ntziachristos, V
    Schnall, M
    Chance, B
    PROCEEDINGS OF THE IEEE 24TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 1998, : 38 - 40
  • [37] Image reconstruction method based on CCD calibration in frequency domain
    Xiong, Sheng-Jun
    Bin Xiangli
    He, Yang
    Zhang, Ze
    APPLIED OPTICS, 2015, 54 (14) : 4561 - 4565
  • [38] SPATIAL-FREQUENCY FILTERING IN HOLOGRAPHIC IMAGE-RECONSTRUCTION
    BOLOGNINI, N
    ARIZMENDI, L
    SOLYMAR, L
    APPLIED OPTICS, 1995, 34 (02): : 243 - 248
  • [39] Terahertz Super-Resolution Image Reconstruction by Frequency Mapping
    Zhu, Ting
    Fang, Guangyou
    Pickwell-MacPherson, Emma
    Chen, Xuequan
    2024 49TH INTERNATIONAL CONFERENCE ON INFRARED, MILLIMETER, AND TERAHERTZ WAVES, IRMMW-THZ 2024, 2024,
  • [40] Consideration on image reconstruction of backprojection filtering in the spatial frequency domain
    Suzuki, S
    Wakabayashi, M
    MEDICAL PHYSICS, 1996, 23 (09) : 1643 - 1645