Cross-Scale KNN Image Transformer for Image Restoration

被引:3
|
作者
Lee, Hunsang [1 ]
Choi, Hyesong [2 ]
Sohn, Kwanghoon [1 ]
Min, Dongbo [2 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea
[2] Ewha Womans Univ, Dept Comp Sci & Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Image restoration; Transformers; Noise reduction; Complexity theory; Computer vision; Convolutional neural networks; Feature extraction; denoising; deblurring; deraining; transformer; self-attention; k-nn search; low-level vision; ALGORITHMS; NETWORK;
D O I
10.1109/ACCESS.2023.3242556
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Numerous image restoration approaches have been proposed based on attention mechanism, achieving superior performance to convolutional neural networks (CNNs) based counterparts. However, they do not leverage the attention model in a form fully suited to the image restoration tasks. In this paper, we propose an image restoration network with a novel attention mechanism, called cross-scale $k$ -NN image Transformer (CS-KiT), that effectively considers several factors such as locality, non-locality, and cross-scale aggregation, which are essential to image restoration. To achieve locality and non-locality, the CS-KiT builds $k$ -nearest neighbor relation of local patches and aggregates similar patches through local attention. To induce cross-scale aggregation, we ensure that each local patch embraces different scale information with scale-aware patch embedding (SPE) which predicts an input patch scale through a combination of multi-scale convolution branches. We show the effectiveness of the CS-KiT with experimental results, outperforming state-of-the-art restoration approaches on image denoising, deblurring, and deraining benchmarks.
引用
收藏
页码:13013 / 13027
页数:15
相关论文
共 50 条
  • [41] Image debanding using cross-scale invertible networks with banded deformable convolutions
    Quan, Yuhui
    He, Xuyi
    Xu, Ruotao
    Xu, Yong
    Ji, Hui
    NEURAL NETWORKS, 2025, 187
  • [42] Cross-scale interdependencies require attention in forest restoration
    Wiegant, Daniel
    Guariguata, Manuel R.
    RESTORATION ECOLOGY, 2023, 31 (08)
  • [43] Image registration combining cross-scale point matching and multi-scale feature fusion
    Ou, Zhuolin
    Lu, Xiaoqi
    Gu, Yu
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (08) : 1090 - 1102
  • [44] A cross Transformer for image denoising
    Tian, Chunwei
    Zheng, Menghua
    Zuo, Wangmeng
    Zhang, Shichao
    Zhang, Yanning
    Lin, Chia-Wen
    INFORMATION FUSION, 2024, 102
  • [45] Hyperspectral Image Super-Resolution Network Based on Cross-Scale Nonlocal Attention
    Li, Shuangliang
    Tian, Yugang
    Wang, Cheng
    Wu, Hongxian
    Zheng, Shaolan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [46] Multifocal Attention-Based Cross-Scale Network for Image De-raining
    Zhang, Zheyu
    Zhu, Yurui
    Fu, Xueyang
    Xiong, Zhiwei
    Zha, Zheng-Jun
    Wu, Feng
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3673 - 3681
  • [47] A dual encoder LDCT image denoising model based on cross-scale skip connections☆
    Wang, Lifang
    Wang, Yali
    Ren, Wenjing
    Yu, Jing
    Chang, Xiaoyan
    Guo, Xiaodong
    Hu, Lihua
    NEUROCOMPUTING, 2025, 613
  • [48] VCAFusion: An infrared and visible image fusion network with visual perception and cross-scale attention
    Zhang, Xiaodong
    Wang, Xinrui
    Gao, Shaoshu
    Zhu, Linghan
    Wang, Shuo
    DIGITAL SIGNAL PROCESSING, 2024, 151
  • [49] Image Copy-Move Forgery Detection via Deep Cross-Scale PatchMatch
    He, Yingjie
    Li, Yuanman
    Chen, Changsheng
    Li, Xia
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2327 - 2332
  • [50] Cross-scale cascade transformer for multimodal human action recognition
    Liu, Zhen
    Cheng, Qin
    Song, Chengqun
    Cheng, Jun
    PATTERN RECOGNITION LETTERS, 2023, 168 : 17 - 23