Frequency-Oriented Efficient Transformer for All-in-One Weather-Degraded Image Restoration

被引:19
|
作者
Gao, Tao [1 ]
Wen, Yuanbo [1 ]
Zhang, Kaihao [2 ]
Zhang, Jing [2 ]
Chen, Ting [1 ]
Liu, Lidong [1 ]
Luo, Wenhan [3 ]
机构
[1] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
[2] Australian Natl Univ, Sch Comp, Canberra, ACT 2601, Australia
[3] Sun Yat sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Guangdong, Peoples R China
关键词
Computer vision; image restoration; adverse weather removal; frequency-oriented transformer; RAINDROP REMOVAL; NETWORK; MODEL;
D O I
10.1109/TCSVT.2023.3299324
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Adverse weather conditions, such as rain, raindrop, snow and haze, consistently degrade images in an unpredictable manner, thereby rendering existing task-specific and task-aligned methods inadequate in addressing this formidable problem. To this end, we investigate the application of Transformer in image restoration and introduce an efficient frequency-oriented method called AIRFormer, which is designed to restore weather-degraded images comprehensively and holistically. Specifically, we identify that the initial self-attention mechanism exhibits distinctive properties akin to a low-pass filter. Therefore, we construct a frequency-guided Transformer encoder by incorporating wavelet-based prior information to guide the extraction of image features. Additionally, considering the non-specific frequency characteristics of self-attention in the later stages, we develop a frequency-refined Transformer decoder that incorporates learnable task-specific queries across spatial dimensions, channel dimensions, and wavelet domains. To facilitate the training of our proposed method, we curate a comprehensive benchmark dataset named AIR40K that, encompasses a wide range of challenging scenarios. Extensive experimental evaluations demonstrate the superiority of our AIRFormer over both task-aligned and all-in-one methods across 15 publicly available datasets. Notably, AIRFormer achieves the best trade-off between the inference time and quality of reconstructed image, comparing with existing methods such as TransWeather and Restormer. The source code, dataset and pre-trained models will be available at https://github.com/chdwyb/AIRFormer.
引用
收藏
页码:1886 / 1899
页数:14
相关论文
共 34 条
  • [31] RDM-IR: Task-adaptive deep unfolding network for All-In-One image restoration
    Cheng, Yuanshuo
    Shao, Mingwen
    Wan, Yecong
    Wang, Chao
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [32] Multi-task Learning-based All-in-one Collaboration Framework for Degraded Image Super-resolution
    Jin, Xin
    Xu, Jianfeng
    Tasaka, Kazuyuki
    Chen, Zhibo
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [33] MACGAN: An All-in-One Image Restoration Under Adverse Conditions Using Multidomain Attention-Based Conditional GAN
    Siddiqua, Maria
    Brahim Belhaouari, Samir
    Akhter, Naeem
    Zameer, Aneela
    Khurshid, Javaid
    IEEE ACCESS, 2023, 11 : 70482 - 70502
  • [34] Ada4DIR: An adaptive model-driven all-in-one image restoration network for remote sensing images
    Lihe, Ziyang
    Yuan, Qiangqiang
    He, Jiang
    Jin, Xianyu
    Xiao, Yi
    Chen, Yuzeng
    Shen, Huanfeng
    Zhang, Liangpei
    INFORMATION FUSION, 2025, 118