Frequency-Oriented Efficient Transformer for All-in-One Weather-Degraded Image Restoration

被引：19

作者：

Gao, Tao ^{[1
]}

Wen, Yuanbo ^{[1
]}

Zhang, Kaihao ^{[2
]}

Zhang, Jing ^{[2
]}

Chen, Ting ^{[1
]}

Liu, Lidong ^{[1
]}

Luo, Wenhan ^{[3
]}

机构：

[1] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China

[2] Australian Natl Univ, Sch Comp, Canberra, ACT 2601, Australia

[3] Sun Yat sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Guangdong, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 03期

关键词：

Computer vision; image restoration; adverse weather removal; frequency-oriented transformer; RAINDROP REMOVAL; NETWORK; MODEL;

D O I：

10.1109/TCSVT.2023.3299324

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Adverse weather conditions, such as rain, raindrop, snow and haze, consistently degrade images in an unpredictable manner, thereby rendering existing task-specific and task-aligned methods inadequate in addressing this formidable problem. To this end, we investigate the application of Transformer in image restoration and introduce an efficient frequency-oriented method called AIRFormer, which is designed to restore weather-degraded images comprehensively and holistically. Specifically, we identify that the initial self-attention mechanism exhibits distinctive properties akin to a low-pass filter. Therefore, we construct a frequency-guided Transformer encoder by incorporating wavelet-based prior information to guide the extraction of image features. Additionally, considering the non-specific frequency characteristics of self-attention in the later stages, we develop a frequency-refined Transformer decoder that incorporates learnable task-specific queries across spatial dimensions, channel dimensions, and wavelet domains. To facilitate the training of our proposed method, we curate a comprehensive benchmark dataset named AIR40K that, encompasses a wide range of challenging scenarios. Extensive experimental evaluations demonstrate the superiority of our AIRFormer over both task-aligned and all-in-one methods across 15 publicly available datasets. Notably, AIRFormer achieves the best trade-off between the inference time and quality of reconstructed image, comparing with existing methods such as TransWeather and Restormer. The source code, dataset and pre-trained models will be available at https://github.com/chdwyb/AIRFormer.

引用

页码：1886 / 1899

页数：14

共 34 条

[31] RDM-IR: Task-adaptive deep unfolding network for All-In-One image restoration
Cheng, Yuanshuo
Shao, Mingwen
Wan, Yecong
Wang, Chao
KNOWLEDGE-BASED SYSTEMS, 2024, 304
[32] Multi-task Learning-based All-in-one Collaboration Framework for Degraded Image Super-resolution
Jin, Xin
Xu, Jianfeng
Tasaka, Kazuyuki
Chen, Zhibo
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
[33] MACGAN: An All-in-One Image Restoration Under Adverse Conditions Using Multidomain Attention-Based Conditional GAN
Siddiqua, Maria
Brahim Belhaouari, Samir
Akhter, Naeem
Zameer, Aneela
Khurshid, Javaid
IEEE ACCESS, 2023, 11 : 70482 - 70502
[34] Ada4DIR: An adaptive model-driven all-in-one image restoration network for remote sensing images
Lihe, Ziyang
Yuan, Qiangqiang
He, Jiang
Jin, Xianyu
Xiao, Yi
Chen, Yuzeng
Shen, Huanfeng
Zhang, Liangpei
INFORMATION FUSION, 2025, 118

← 1 2 3 4 →