Exploring vision transformers and XGBoost as deep learning ensembles for transforming carcinoma recognition

被引:0
|
作者
Raju, Akella Subrahmanya Narasimha [1 ]
Venkatesh, K. [2 ]
Padmaja, B. [3 ]
Kumar, CH. N. Santhosh [4 ]
Patnala, Pattabhi Rama Mohan [5 ]
Lasisi, Ayodele [6 ]
Islam, Saiful [7 ]
Razak, Abdul [8 ]
Khan, Wahaj Ahmad [9 ]
机构
[1] Inst Aeronaut Engn, Dept Comp Sci & Engn Data Sci, Hyderabad 500043, Telangana, India
[2] SRM Inst Sci & Technol, Sch Comp, Dept Networking & Commun, Chennai 603203, Tamilnadu, India
[3] Inst Aeronaut Engn, Dept Comp Sci & Engn, AI&ML, Hyderabad 500043, India
[4] Anurag Engn Coll, Dept Comp Sci & Engn, Kodada 508206, Telangana, India
[5] Aditya Univ, Dept Comp Applicat, Surampalem 533437, Andhra Pradesh, India
[6] King Khalid Univ, Coll Comp Sci, Dept Comp Sci, Abha, Saudi Arabia
[7] King Khalid Univ, Coll Engn, Civil Engn Dept, Abha 61421, Saudi Arabia
[8] Visvesvaraya Technol Univ Belagavi, PA Coll Engn, Dept Mech Engn, Mangaluru, India
[9] Dire Dawa Univ, Sch Civil Engn & Architecture, Inst Technol, Dire Dawa 1362, Ethiopia
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Colorectal Carcinoma (CRC); Integrated CNNs; Vision Transformers; XGBoost; Ensemble models; CKHK-22; dataset;
D O I
10.1038/s41598-024-81456-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Early detection of colorectal carcinoma (CRC), one of the most prevalent forms of cancer worldwide, significantly enhances the prognosis of patients. This research presents a new method for improving CRC detection using a deep learning ensemble with the Computer Aided Diagnosis (CADx). The method involves combining pre-trained convolutional neural network (CNN) models, such as ADaRDEV2I-22, DaRD-22, and ADaDR-22, using Vision Transformers (ViT) and XGBoost. The study addresses the challenges associated with imbalanced datasets and the necessity of sophisticated feature extraction in medical image analysis. Initially, the CKHK-22 dataset comprised 24 classes. However, we refined it to 14 classes, which led to an improvement in data balance and quality. This improvement enabled more precise feature extraction and improved classification results. We created two ensemble models: the first model used Vision Transformers to capture long-range spatial relationships in the images, while the second model combined CNNs with XGBoost to facilitate structured data classification. We implemented DCGAN-based augmentation to enhance the dataset's diversity. The tests showed big improvements in performance, with the ADaDR-22 + Vision Transformer group getting the best results, with a testing accuracy of 93.4% and an AUC of 98.8%. In contrast, the ADaDR-22 + XGBoost model had an AUC of 97.8% and an accuracy of 92.2%. These findings highlight the efficacy of the proposed ensemble models in detecting CRC and highlight the importance of using well-balanced, high-quality datasets. The proposed method significantly enhances the clinical diagnostic accuracy and the capabilities of medical image analysis or early CRC detection.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] Ensembles of Deep Learning Models and Transfer Learning for Ear Recognition
    Alshazly, Hammam
    Linse, Christoph
    Barth, Erhardt
    Martinetz, Thomas
    SENSORS, 2019, 19 (19)
  • [2] Exploring Self-Supervised Vision Transformers for Gait Recognition in the Wild
    Cosma, Adrian
    Catruna, Andy
    Radoi, Emilian
    SENSORS, 2023, 23 (05)
  • [3] Deep learning ensembles for melanoma recognition in dermoscopy images
    Codella, N. C. F.
    Nguyen, Q. -B.
    Pankanti, S.
    Gutman, D. A.
    Helba, B.
    Halpern, A. C.
    Smith, J. R.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2017, 61 (4-5)
  • [4] Weeds Classification with Deep Learning: An Investigation Using CNN, Vision Transformers, Pyramid Vision Transformers, and Ensemble Strategy
    Rozendo, Guilherme Botazzo
    Roberto, Guilherme Freire
    Zanchetta do Nascimento, Marcelo
    Neves, Leandro Alves
    Lumini, Alessandra
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 229 - 243
  • [5] Transforming Challenges: Siamese-Based Vision Transformers for Robust Occluded Face Recognition
    Ouannes, Laila
    Ben Khalifa, Anouar
    Ben Amara, Najoua Essoukri
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PT II, 2024, 2166 : 260 - 272
  • [6] Exploring the Potential of Ensembles of Deep Learning Networks for Image Segmentation
    Nanni, Loris
    Lumini, Alessandra
    Fantozzi, Carlo
    INFORMATION, 2023, 14 (12)
  • [7] Enhancing Computer Vision Performance: A Hybrid Deep Learning Approach with CNNs and Vision Transformers
    Sardar, Abha Singh
    Ranjan, Vivek
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 591 - 602
  • [8] Vision Transformers and Transfer Learning Approaches for Arabic Sign Language Recognition
    Alharthi, Nojood M.
    Alzahrani, Salha M.
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [9] Object Detection Using Deep Learning, CNNs and Vision Transformers: A Review
    Amjoud, Ayoub Benali
    Amrouch, Mustapha
    IEEE ACCESS, 2023, 11 : 35479 - 35516
  • [10] Medicinal Plant Leaf Classification using Deep Learning and Vision Transformers
    Hossain, Shahriar
    Hasan, Rizbanul
    Uddin, Jia
    BAGHDAD SCIENCE JOURNAL, 2025, 22 (03)