Improving attention mechanisms in transformer architecture in image restoration

被引:0
|
作者
Berezhnov, N. I. [1 ]
Sirota, A. A. [1 ]
机构
[1] Voronezh State Univ, Comp Sci Fac, Informat Secur & Proc Technol Dept, Univ Skaya Sq 1, Voronezh 394018, Russia
关键词
image quality improvement; neural networks; transformer models; attention mechanism;
D O I
10.18287/2412-6179-CO-1393
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We discuss a problem of improving the quality of images obtained under the influence of various kinds of noise and distortion. In this work we solve this problem using transformer neural network models, because they have recently shown high efficiency in computer vision tasks. An attention mechanism of transformer models is investigated and problems associated with the implementation of the existing approaches based on this mechanism are identified. We propose a novel modification of the attention mechanism with the aim of reducing the number of neural network parameters, conducting a comparison of the proposed transformer model with the known ones. Several datasets with natural and generated distortions are considered. For training neural networks, the Edge Loss function is used to preserve the sharpness of images in the process of noise elimination. The influence of the degree of compression of channel information in the proposed attention mechanism on the image restoration quality is investigated. PSNR, SSIM, and FID metrics are used to assess the quality of the restored images and for a comparison with the existing neural network architectures for each of the datasets. It is confirmed that the architecture proposed by the present authors is, at least, not inferior to the known approaches in improving the image quality, while requiring less computing resources. The quality of the improved images is shown to slightly decrease for the naked human eye with an increase in the channel information compression ratio within reasonable limits.
引用
收藏
页码:726 / 733
页数:9
相关论文
共 50 条
  • [21] A Prior Guided Wavelet-Spatial Dual Attention Transformer Framework for Heavy Rain Image Restoration
    Zhang, Ronghui
    Yu, Jiongze
    Chen, Junzhou
    Li, Guofa
    Lin, Liang
    Wang, Danwei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7043 - 7057
  • [22] Pyramid Attention Network for Image Restoration
    Yiqun Mei
    Yuchen Fan
    Yulun Zhang
    Jiahui Yu
    Yuqian Zhou
    Ding Liu
    Yun Fu
    Thomas S. Huang
    Humphrey Shi
    International Journal of Computer Vision, 2023, 131 : 3207 - 3225
  • [23] RT-CBAM: Refined Transformer Combined with Convolutional Block Attention Module for Underwater Image Restoration
    Ye, Renchuan
    Qian, Yuqiang
    Huang, Xinming
    SENSORS, 2024, 24 (18)
  • [24] SYNERGIC FEATURE ATTENTION FOR IMAGE RESTORATION
    Mou, Chong
    Zhang, Jian
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1850 - 1854
  • [25] Pyramid Attention Network for Image Restoration
    Mei, Yiqun
    Fan, Yuchen
    Zhang, Yulun
    Yu, Jiahui
    Zhou, Yuqian
    Liu, Ding
    Fu, Yun
    Huang, Thomas S.
    Shi, Humphrey
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (12) : 3207 - 3225
  • [26] KNN Local Attention for Image Restoration
    Lee, Hunsang
    Choi, Hyesong
    Sohn, Kwanghoon
    Min, Dongbo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2129 - 2139
  • [27] Attention Cube Network for Image Restoration
    Hang, Yucheng
    Liao, Qingmin
    Yang, Wenming
    Chen, Yupeng
    Zhou, Jie
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2562 - 2570
  • [28] SwinIR: Image Restoration Using Swin Transformer
    Liang, Jingyun
    Cao, Jiezhang
    Sun, Guolei
    Zhang, Kai
    Van Gool, Luc
    Timofte, Radu
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1833 - 1844
  • [29] Burstormer: Burst Image Restoration and Enhancement Transformer
    Dudhane, Akshay
    Zamir, Syed Waqas
    Khan, Salman
    Khan, Fahad Shahbaz
    Yang, Ming-Hsuan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5703 - 5712
  • [30] Comprehensive and Delicate: An Efficient Transformer for Image Restoration
    Zhao, Haiyu
    Gou, Yuanbiao
    Li, Boyun
    Peng, Dezhong
    Lv, Jiancheng
    Peng, Xi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14122 - 14132