Performance Comparison of Vision Transformer-Based Models in Medical Image Classification

被引：1

作者：

Kanca, Elif ^{[1
]}

Ayas, Selen ^{[2
]}

Kablan, Elif Baykal ^{[1
]}

Ekinci, Murat ^{[2
]}

机构：

[1] Karadeniz Tech Univ, Yazilim Muhendisligi, Trabzon, Turkiye

[2] Karadeniz Tech Univ, Bilgisayar Muhendisligi, Trabzon, Turkiye

来源：

2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2023年

关键词：

Vision transformer-based models; transformers; medical image classification;

D O I：

10.1109/SIU59756.2023.10223892

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, convolutional neural networks have shown significant success and are frequently used in medical image analysis applications. However, the convolution process in convolutional neural networks limits learning of long-term pixel dependencies in the local receptive field. Inspired by the success of transformer architectures in encoding long-term dependencies and learning more efficient feature representation in natural language processing, publicly available color fundus retina, skin lesion, chest X-ray, and breast histology images are classified using Vision Transformer (ViT), Data-Efficient Transformer (DeiT), Swin Transformer, and Pyramid Vision Transformer v2 (PVTv2) models and their classification performances are compared in this study. The results show that the highest accuracy values are obtained with the DeiT model at 96.5% in the chest X-ray dataset, the PVTv2 model at 91.6% in the breast histology dataset, the PVTv2 model at 91.3% in the retina fundus dataset, and the Swin model at 91.0% in the skin lesion dataset.

引用

页数：4

共 50 条

[21] StainSWIN: Vision transformer-based stain normalization for histopathology image analysis
Kablan, Elif Baykal
Ayas, Selen
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[22] Transformer-Based Innovations in Medical Image Segmentation: A Mini Review
Ovais Iqbal Shah
Danish Raza Rizvi
Aqib Nazir Mir
SN Computer Science, 6 (4)
[23] Classification and recognition of gesture EEG signals with Transformer-Based models
Qu, Yan
Li, Congsheng
Jiang, Haoyu
2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 415 - 418
[24] Transformer-Based Composite Language Models for Text Evaluation and Classification
Skoric, Mihailo
Utvic, Milos
Stankovic, Ranka
MATHEMATICS, 2023, 11 (22)
[25] Transformer-based Image Compression
Lu, Ming
Guo, Peiyao
Shi, Huiqing
Cao, Chuntong
Ma, Zhan
DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 469 - 469
[26] Vision Transformer (ViT)-based Applications in Image Classification
Huo, Yingzi
Jin, Kai
Cai, Jiahong
Xiong, Huixuan
Pang, Jiacheng
2023 IEEE 9TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD, BIGDATASECURITY, IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC AND IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS, 2023, : 135 - 140
[27] Explaining transformer-based image captioning models: An empirical analysis
Cornia, Marcella
Baraldi, Lorenzo
Cucchiara, Rita
AI COMMUNICATIONS, 2022, 35 (02) : 111 - 129
[28] Incorporating Medical Knowledge to Transformer-based Language Models for Medical Dialogue Generation
Naseem, Usman
Bandi, Ajay
Raza, Shaina
Rashid, Junaid
Chakravarthi, Bharathi Raja
PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 110 - 115
[29] Transformer-Based Masked Autoencoder With Contrastive Loss for Hyperspectral Image Classification
Cao, Xianghai
Lin, Haifeng
Guo, Shuaixu
Xiong, Tao
Jiao, Licheng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[30] Vision Transformer-based Classification for Lung and Colon Cancer using Histopathology Images
Hasan, Munjur
Rahman, Md Saifur
Islam, Sabrina
Ahmed, Tanvir
Rifat, Nafiz
Ahsan, Mostofa
Gomes, Rahul
Chowdhury, Md.
Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023, 2023, : 1300 - 1304

← 1 2 3 4 5 →