Understanding How Image Quality Affects Transformer Neural Networks

被引：0

作者：

Varga, Domonkos ^{[1
]}

机构：

[1] Nokia Bell Labs, H-1083 Budapest, Hungary

来源：

SIGNALS | 2024年 / 5卷 / 03期

关键词：

transformer models; image classification; noise sensitivity; computer vision; RESOLUTION FACE RECOGNITION;

D O I：

10.3390/signals5030031

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep learning models, particularly transformer architectures, have revolutionized various computer vision tasks, including image classification. However, their performance under different types and levels of noise remains a crucial area of investigation. In this study, we explore the noise sensitivity of prominent transformer models trained on the ImageNet dataset. We systematically evaluate 22 transformer variants, ranging from state-of-the-art large-scale models to compact versions tailored for mobile applications, under five common types of image distortions. Our findings reveal diverse sensitivities across different transformer architectures, with notable variations in performance observed under additive Gaussian noise, multiplicative Gaussian noise, Gaussian blur, salt-and-pepper noise, and JPEG compression. Interestingly, we observe a consistent robustness of transformer models to JPEG compression, with top-5 accuracies exhibiting higher resilience to noise compared to top-1 accuracies. Furthermore, our analysis highlights the vulnerability of mobile-oriented transformer variants to various noise types, underscoring the importance of noise robustness considerations in model design and deployment for real-world applications. These insights contribute to a deeper understanding of transformer model behavior under noisy conditions and have implications for improving the robustness and reliability of deep learning systems in practical scenarios.

引用

页码：562 / 579

页数：18

共 50 条

[11] Neural Networks for Omni-View Road Image Understanding
朱志刚
徐光祐
Journal of Computer Science and Technology, 1996, (06) : 570 - 576
[12] Understanding calibration of deep neural networks for medical image classification
Sambyal, Abhishek Singh
Niyaz, Usma
Krishnan, Narayanan C.
Bathula, Deepti R.
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 242
[13] Understanding How Deep Neural Networks Learn Face Expressions
Mousavi, Nima
Siqueira, Henrique
Barros, Pablo
Fernandes, Bruno
Wermter, Stefan
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 227 - 234
[14] Understanding How Orthogonality of Parameters Improves Quantization of Neural Networks
Eryilmaz, Sukru Burc
Dundar, Aysegul
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10737 - 10746
[15] How You See Me: Understanding Convolutional Neural Networks
Gandikota, Rohit
Mishra, Deepak
PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 2069 - 2073
[16] Image classification by neural networks for the quality control of watches
Moreira, M
Fiesler, E
Pante, G
PROCEEDINGS ISAI/IFIS 1996 - MEXICO - USA COLLABORATION IN INTELLIGENT SYSTEMS TECHNOLOGIES, 1996, : 141 - 149
[17] How the fill factor of a photodetector array affects the quality of a thermal image
Utenkov, A.B.
Belousov, Yu.I.
Smirnov, A.L.
Journal of Optical Technology (A Translation of Opticheskii Zhurnal), 2001, 68 (08): : 603 - 607
[18] How the fill factor of a photodetector array affects the quality of a thermal image
Utenkov, AB
Belousov, YI
Smirnov, AL
JOURNAL OF OPTICAL TECHNOLOGY, 2001, 68 (08) : 603 - 607
[19] Transformer-Auxiliary Neural Networks for Image Manipulation Localization by Operator Inductions
Shi, Zenan
Chen, Haipeng
Zhang, Dong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4907 - 4920
[20] Ctnet: rethinking convolutional neural networks and vision transformer for medical image segmentation
Zhang, Zhixin
Jiang, Shuhao
Pan, Xuhua
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2265 - 2275

← 1 2 3 4 5 →