Misalignment-Robust Frequency Distribution Loss for Image Transformation

被引:0
|
作者
Ni, Zhangkai [1 ]
Wu, Juncheng [1 ]
Wang, Zian [1 ]
Yang, Wenhan [2 ]
Wang, Hanli [1 ]
Ma, Lin [3 ]
机构
[1] Tongji Univ, Shanghai, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Meituan, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52733.2024.00281
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper aims to address a common challenge in deep learning-based image transformation methods, such as image enhancement and super-resolution, which heavily rely on precisely aligned paired datasets with pixel-level alignments. However, creating precisely aligned paired images presents significant challenges and hinders the advancement of methods trained on such data. To overcome this challenge, this paper introduces a novel and simple Frequency Distribution Loss (FDL) for computing distribution distance within the frequency domain. Specifically, we transform image features into the frequency domain using Discrete Fourier Transformation (DFT). Subsequently, frequency components (amplitude and phase) are processed separately to form the FDL loss function. Our method is empirically proven effective as a training constraint due to the thoughtful utilization of global information in the frequency domain. Extensive experimental evaluations, focusing on image enhancement and super-resolution tasks, demonstrate that FDL outperforms existing misalignment-robust loss functions. Furthermore, we explore the potential of our FDL for image style transfer that relies solely on completely misaligned data. Our code is available at: https://github.com/eezkni/FDL
引用
收藏
页码:2910 / 2919
页数:10
相关论文
共 50 条
  • [1] Misalignment-Robust Face Recognition
    Yan, Shuicheng
    Wang, Huan
    Liu, Jianzhuang
    Tang, Xiaoou
    Huang, Thomas S.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (04) : 1087 - 1096
  • [2] Misalignment-robust face recognition
    Wang, Huan
    Yan, Shuicheng
    Huang, Thomas
    Liu, Jianzhuang
    Tang, Xiaoou
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 3591 - +
  • [3] Misalignment-robust, edge-based image fusion method
    Xi, Cai
    Wei, Zhao
    OPTICAL ENGINEERING, 2012, 51 (07)
  • [4] Misalignment-Robust Joint Filter for Cross-Modal Image Pairs
    Shibata, Takashi
    Tanaka, Masayuki
    Okutomi, Masatoshi
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3315 - 3324
  • [5] Misalignment-Robust Pedestrian Detection Framework for Visible and Far-Infrared Image Pairs
    Shibata, Takashi
    Sawada, Azusa
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2019,
  • [6] Biased subspace learning for misalignment-robust facial expression recognition
    Yan, Haibin
    NEUROCOMPUTING, 2016, 208 : 202 - 209
  • [7] Supervised sparse patch coding towards misalignment-robust face recognition
    Lang, Congyan
    Feng, Songhe
    Chen, Bin
    Yuan, Xiaotong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (02) : 103 - 110
  • [8] Efficient misalignment-robust multi-focus microscopical images fusion
    Liang, Yixiong
    Mao, Yuan
    Tang, Zhihong
    Yan, Meng
    Zhao, Yuqian
    Liu, Jianfeng
    SIGNAL PROCESSING, 2019, 161 : 111 - 123
  • [9] Efficient and Misalignment-Robust PT-Symmetric Wireless Power Transfer
    Sakhdari, Maryam
    Chen, Pai-Yen
    2019 IEEE INTERNATIONAL SYMPOSIUM ON ANTENNAS AND PROPAGATION AND USNC-URSI RADIO SCIENCE MEETING, 2019, : 1459 - 1460
  • [10] Cross-modal misalignment-robust feature fusion for crowd counting
    Kong, Weihang
    Yu, Zepeng
    Li, He
    Zhang, Junge
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136