共 50 条
Classification of Brain Tumor from Magnetic Resonance Imaging Using Vision Transformers Ensembling
被引:85
|作者:
Tummala, Sudhakar
[1
]
Kadry, Seifedine
[2
,3
,4
]
Bukhari, Syed Ahmad Chan
[5
]
Rauf, Hafiz Tayyab
[6
]
机构:
[1] SRM Univ AP, Sch Engn & Sci, Dept Elect & Commun Engn, Amaravati 522503, India
[2] Noroff Univ Coll, Dept Appl Data Sci, N-4612 Kristiansand, Norway
[3] Lebanese Amer Univ, Dept Elect & Comp Engn, POB 36, Byblos, Lebanon
[4] Ajman Univ, Coll Engn & Informat Technol, Artificial Intelligence Res Ctr AIRC, Ajman 346, U Arab Emirates
[5] St Johns Univ, Div Comp Sci Math & Sci, Collins Coll Profess Studies, New York, NY 11439 USA
[6] Staffordshire Univ, Ctr Smart Syst AI & Cybersecur, Stoke On Trent ST4 2DE, Staffs, England
关键词:
brain tumor;
MRI;
diagnosis;
vision transformer;
D O I:
10.3390/curroncol29100590
中图分类号:
R73 [肿瘤学];
学科分类号:
100214 ;
摘要:
The automated classification of brain tumors plays an important role in supporting radiologists in decision making. Recently, vision transformer (ViT)-based deep neural network architectures have gained attention in the computer vision research domain owing to the tremendous success of transformer models in natural language processing. Hence, in this study, the ability of an ensemble of standard ViT models for the diagnosis of brain tumors from T1-weighted (T1w) magnetic resonance imaging (MRI) is investigated. Pretrained and finetuned ViT models (B/16, B/32, L/16, and L/32) on ImageNet were adopted for the classification task. A brain tumor dataset from figshare, consisting of 3064 T1w contrast-enhanced (CE) MRI slices with meningiomas, gliomas, and pituitary tumors, was used for the cross-validation and testing of the ensemble ViT model's ability to perform a three-class classification task. The best individual model was L/32, with an overall test accuracy of 98.2% at 384 x 384 resolution. The ensemble of all four ViT models demonstrated an overall testing accuracy of 98.7% at the same resolution, outperforming individual model's ability at both resolutions and their ensembling at 224 x 224 resolution. In conclusion, an ensemble of ViT models could be deployed for the computer-aided diagnosis of brain tumors based on T1w CE MRI, leading to radiologist relief.
引用
收藏
页码:7498 / 7511
页数:14
相关论文