Understanding How Image Quality Affects Transformer Neural Networks

被引:0
|
作者
Varga, Domonkos [1 ]
机构
[1] Nokia Bell Labs, H-1083 Budapest, Hungary
来源
SIGNALS | 2024年 / 5卷 / 03期
关键词
transformer models; image classification; noise sensitivity; computer vision; RESOLUTION FACE RECOGNITION;
D O I
10.3390/signals5030031
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning models, particularly transformer architectures, have revolutionized various computer vision tasks, including image classification. However, their performance under different types and levels of noise remains a crucial area of investigation. In this study, we explore the noise sensitivity of prominent transformer models trained on the ImageNet dataset. We systematically evaluate 22 transformer variants, ranging from state-of-the-art large-scale models to compact versions tailored for mobile applications, under five common types of image distortions. Our findings reveal diverse sensitivities across different transformer architectures, with notable variations in performance observed under additive Gaussian noise, multiplicative Gaussian noise, Gaussian blur, salt-and-pepper noise, and JPEG compression. Interestingly, we observe a consistent robustness of transformer models to JPEG compression, with top-5 accuracies exhibiting higher resilience to noise compared to top-1 accuracies. Furthermore, our analysis highlights the vulnerability of mobile-oriented transformer variants to various noise types, underscoring the importance of noise robustness considerations in model design and deployment for real-world applications. These insights contribute to a deeper understanding of transformer model behavior under noisy conditions and have implications for improving the robustness and reliability of deep learning systems in practical scenarios.
引用
收藏
页码:562 / 579
页数:18
相关论文
共 50 条
  • [11] Neural Networks for Omni-View Road Image Understanding
    朱志刚
    徐光祐
    Journal of Computer Science and Technology, 1996, (06) : 570 - 576
  • [12] Understanding calibration of deep neural networks for medical image classification
    Sambyal, Abhishek Singh
    Niyaz, Usma
    Krishnan, Narayanan C.
    Bathula, Deepti R.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242
  • [13] Understanding How Deep Neural Networks Learn Face Expressions
    Mousavi, Nima
    Siqueira, Henrique
    Barros, Pablo
    Fernandes, Bruno
    Wermter, Stefan
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 227 - 234
  • [14] Understanding How Orthogonality of Parameters Improves Quantization of Neural Networks
    Eryilmaz, Sukru Burc
    Dundar, Aysegul
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10737 - 10746
  • [15] How You See Me: Understanding Convolutional Neural Networks
    Gandikota, Rohit
    Mishra, Deepak
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 2069 - 2073
  • [16] Image classification by neural networks for the quality control of watches
    Moreira, M
    Fiesler, E
    Pante, G
    PROCEEDINGS ISAI/IFIS 1996 - MEXICO - USA COLLABORATION IN INTELLIGENT SYSTEMS TECHNOLOGIES, 1996, : 141 - 149
  • [17] How the fill factor of a photodetector array affects the quality of a thermal image
    Utenkov, A.B.
    Belousov, Yu.I.
    Smirnov, A.L.
    Journal of Optical Technology (A Translation of Opticheskii Zhurnal), 2001, 68 (08): : 603 - 607
  • [18] How the fill factor of a photodetector array affects the quality of a thermal image
    Utenkov, AB
    Belousov, YI
    Smirnov, AL
    JOURNAL OF OPTICAL TECHNOLOGY, 2001, 68 (08) : 603 - 607
  • [19] Transformer-Auxiliary Neural Networks for Image Manipulation Localization by Operator Inductions
    Shi, Zenan
    Chen, Haipeng
    Zhang, Dong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4907 - 4920
  • [20] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
    Zhang, Zhixin
    Jiang, Shuhao
    Pan, Xuhua
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2265 - 2275