An efficient multi-scale transformer for satellite image dehazing

被引:2
|
作者
Yang, Lei [1 ,2 ,6 ]
Cao, Jianzhong [1 ,2 ]
Chen, Weining [1 ]
Wang, Hao [1 ]
He, Lang [3 ,4 ,5 ]
机构
[1] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Xian, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China
[3] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, 618 Changan West St, Xian 710121, Peoples R China
[4] Xian Univ Posts & Telecommun, Shaanxi Key Lab Network Data Anal & Intelligent Pr, Xian, Peoples R China
[5] Xian Univ Posts & Telecommun, Xian Key Lab Big Data & Intelligent Comp, Xian, Peoples R China
[6] Xian Inst Opt & Precis Mech, Sch Comp Sci & Technol, 17 Xinxi Rd,New Ind Pk, Xian 710119, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
remote sensing image; satellite image dehazing; transformer;
D O I
10.1111/exsy.13575
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the impressive achievement of convolutional neural networks (CNNs) in grasping image priors from extensive datasets, they have been widely utilized for tasks related to image restoration. Recently, there is been significant progress in another category of neural architectures-Transformers. These models have demonstrated remarkable performance in natural language tasks and higher-level vision applications. Despite their ability to address some of CNNs limitations, such as restricted receptive fields and adaptability issues, Transformer models often face difficulties when processing images with a high level of detail. This is because the complexity of the computations required increases significantly with the image's spatial resolution. As a result, their application to most high-resolution image restoration tasks becomes impractical. In our research, we introduce a novel Transformer model, named DehFormer, by implementing specific design modifications in its fundamental components, for example, the multi-head attention and feed-forward network. Specifically, the proposed architecture consists of the three modules, that is, (a) multi-scale feature aggregation network (MSFAN), (b) the gated-Dconv feed-forward network (GFFN), (c) and the multi-Dconv head transposed attention (MDHTA). For the MDHTA module, our objective is to scrutinize the mechanics of scaled dot-product attention through the utilization of per-element product operations, thereby bypassing the need for matrix multiplications and operating directly in the frequency domain for enhanced efficiency. For the GFFN module, which enables only the relevant and valuable information to advance through the network hierarchy, thereby enhancing the efficiency of information flow within the model. Extensive experiments are conducted on the SateHazelk, RS-Haze, and RSID datasets, resulting in performance that significantly exceeds that of existing methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Multi-scale depth information fusion network for image dehazing
    Fan, Guodong
    Hua, Zhen
    Li, Jinjiang
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7262 - 7280
  • [22] Single Image Dehazing via Lightweight Multi-scale Networks
    Tang, Guiying
    Zhao, Li
    Jiang, Runhua
    Zhang, Xiaoqin
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5062 - 5069
  • [23] MUSIQ: Multi-scale Image Quality Transformer
    Ke, Junjie
    Wang, Qifei
    Wang, Yilin
    Milanfar, Peyman
    Yang, Feng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5128 - 5137
  • [24] Multi-Scale Efficient Graph-Transformer for Whole Slide Image Classification
    Ding, Saisai
    Li, Juncheng
    Wang, Jun
    Ying, Shihui
    Shi, Jun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (12) : 5926 - 5936
  • [25] Efficient Multi-Scale Cosine Attention Transformer for Image Super-Resolution
    Chen, Yuzhen
    Wang, Gencheng
    Chen, Rong
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1442 - 1446
  • [26] Adaptive Multi-Scale Transformer Tracker for Satellite Videos
    Zhang, Xin
    Jiao, Licheng
    Li, Lingling
    Liu, Xu
    Liu, Fang
    Ma, Wenping
    Yang, Shuyuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [27] Single Fog Image dehazing via fast Multi-scale Image Fusion
    Gao, Yin
    Lan, Xiaodong
    Cai, Rongsheng
    Li, Jun
    IFAC PAPERSONLINE, 2019, 52 (24): : 225 - 230
  • [28] Nighttime Image Dehazing Based on Multi-Scale Gated Fusion Network
    Zhao, Bo
    Wu, Han
    Ma, Zhiyang
    Fu, Huini
    Ren, Wenqi
    Liu, Guizhong
    ELECTRONICS, 2022, 11 (22)
  • [29] Multi-Scale Density-Aware Network for Single Image Dehazing
    Gao, Tao
    Liu, Yao
    Cheng, Peng
    Chen, Ting
    Liu, Lidong
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1117 - 1121
  • [30] AeroDehazeNet: Exploiting Selective Multi-Scale Transformers for Aerial Image Dehazing
    Gonde, Kartik
    Patil, Prashant W.
    Vipparthi, Santosh Kumar
    Murala, Subrahmanyam
    Patil, Pramod
    Kimbahune, Vinod
    2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, AVSS 2024, 2024,