共 34 条
Frequency-Oriented Efficient Transformer for All-in-One Weather-Degraded Image Restoration
被引:19
|作者:
Gao, Tao
[1
]
Wen, Yuanbo
[1
]
Zhang, Kaihao
[2
]
Zhang, Jing
[2
]
Chen, Ting
[1
]
Liu, Lidong
[1
]
Luo, Wenhan
[3
]
机构:
[1] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
[2] Australian Natl Univ, Sch Comp, Canberra, ACT 2601, Australia
[3] Sun Yat sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Guangdong, Peoples R China
关键词:
Computer vision;
image restoration;
adverse weather removal;
frequency-oriented transformer;
RAINDROP REMOVAL;
NETWORK;
MODEL;
D O I:
10.1109/TCSVT.2023.3299324
中图分类号:
TM [电工技术];
TN [电子技术、通信技术];
学科分类号:
0808 ;
0809 ;
摘要:
Adverse weather conditions, such as rain, raindrop, snow and haze, consistently degrade images in an unpredictable manner, thereby rendering existing task-specific and task-aligned methods inadequate in addressing this formidable problem. To this end, we investigate the application of Transformer in image restoration and introduce an efficient frequency-oriented method called AIRFormer, which is designed to restore weather-degraded images comprehensively and holistically. Specifically, we identify that the initial self-attention mechanism exhibits distinctive properties akin to a low-pass filter. Therefore, we construct a frequency-guided Transformer encoder by incorporating wavelet-based prior information to guide the extraction of image features. Additionally, considering the non-specific frequency characteristics of self-attention in the later stages, we develop a frequency-refined Transformer decoder that incorporates learnable task-specific queries across spatial dimensions, channel dimensions, and wavelet domains. To facilitate the training of our proposed method, we curate a comprehensive benchmark dataset named AIR40K that, encompasses a wide range of challenging scenarios. Extensive experimental evaluations demonstrate the superiority of our AIRFormer over both task-aligned and all-in-one methods across 15 publicly available datasets. Notably, AIRFormer achieves the best trade-off between the inference time and quality of reconstructed image, comparing with existing methods such as TransWeather and Restormer. The source code, dataset and pre-trained models will be available at https://github.com/chdwyb/AIRFormer.
引用
收藏
页码:1886 / 1899
页数:14
相关论文