Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System

被引:12
|
作者
Himel, Galib Muhammad Shahriar [1 ]
Islam, Md. Masudul [1 ]
Al-Aff, Kh. Abdullah [2 ]
Karim, Shams Ibne [3 ]
Sikder, Md. Kabir Uddin [1 ]
机构
[1] Jahangirnagar Univ, Dhaka, Bangladesh
[2] Bangladesh Univ Hlth Sci, Dhaka, Bangladesh
[3] Bangabandhu Sheikh Mujib Med Univ, Dhaka, Bangladesh
关键词
Automatic analysis - Cancer classification - Digital system - Google+ - Health concerns - Images classification - Learning architectures - Learning models - Performance - Skin cancers;
D O I
10.1155/2024/3022192
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Skin cancer is a significant health concern worldwide, and early and accurate diagnosis plays a crucial role in improving patient outcomes. In recent years, deep learning models have shown remarkable success in various computer vision tasks, including image classification. In this research study, we introduce an approach for skin cancer classification using vision transformer, a state-of-the-art deep learning architecture that has demonstrated exceptional performance in diverse image analysis tasks. The study utilizes the HAM10000 dataset; a publicly available dataset comprising 10,015 skin lesion images classified into two categories: benign (6705 images) and malignant (3310 images). This dataset consists of high-resolution images captured using dermatoscopes and carefully annotated by expert dermatologists. Preprocessing techniques, such as normalization and augmentation, are applied to enhance the robustness and generalization of the model. The vision transformer architecture is adapted to the skin cancer classification task. The model leverages the self-attention mechanism to capture intricate spatial dependencies and long-range dependencies within the images, enabling it to effectively learn relevant features for accurate classification. Segment Anything Model (SAM) is employed to segment the cancerous areas from the images; achieving an IOU of 96.01% and Dice coefficient of 98.14% and then various pretrained models are used for classification using vision transformer architecture. Extensive experiments and evaluations are conducted to assess the performance of our approach. The results demonstrate the superiority of the vision transformer model over traditional deep learning architectures in skin cancer classification in general with some exceptions. Upon experimenting on six different models, ViT-Google, ViT-MAE, ViT-ResNet50, ViT-VAN, ViT-BEiT, and ViT-DiT, we found out that the ML approach achieves 96.15% accuracy using Google's ViT patch-32 model with a low false negative ratio on the test dataset, showcasing its potential as an effective tool for aiding dermatologists in the diagnosis of skin cancer.
引用
收藏
页数:18
相关论文
共 39 条
  • [1] Vision transformer and CNN-based skin lesion analysis: classification of monkeypox
    Yolcu Oztel G.
    Multimedia Tools and Applications, 2024, 83 (28) : 71909 - 71923
  • [2] An efficient multi-class classification of skin cancer using optimized vision transformer
    R. P. Desale
    P. S. Patil
    Medical & Biological Engineering & Computing, 2024, 62 : 773 - 789
  • [3] An efficient multi-class classification of skin cancer using optimized vision transformer
    Desale, R. P.
    Patil, P. S.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (03) : 773 - 789
  • [4] A novel Deeplabv3+and vision-based transformer model for segmentation and classification of skin lesions
    Ahmad, Iqra
    Amin, Javaria
    Lali, Muhammad IkramUllah
    Abbas, Farhat
    Sharif, Muhammad Imran
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
  • [5] Automatic cervical cancer classification using adaptive vision transformer encoder with CNN for medical application
    Nirmala, G.
    Nayudu, P. Prathap
    Kumar, A. Ranjith
    Sagar, Renuka
    PATTERN RECOGNITION, 2025, 160
  • [6] Classification of Mobile-Based Oral Cancer Images Using the Vision Transformer and the Swin Transformer
    Song, Bofan
    Raj, Dharma K. C.
    Yang, Rubin Yuchan
    Li, Shaobai
    Zhang, Chicheng
    Liang, Rongguang
    CANCERS, 2024, 16 (05)
  • [7] Automatic Vision Based Classification System Using DNN and SVM Classifiers
    Durairajah, Vickneswari
    Gobee, Suresh
    Muneer, Amgad
    2018 3RD INTERNATIONAL CONFERENCE ON CONTROL, ROBOTICS AND CYBERNETICS (CRC), 2018, : 6 - 14
  • [8] Digital Image Based Segmentation and Classification of Tongue Cancer Using CNN
    Pallavi Pahadiya
    Ritu Vijay
    Kumod Kumar Gupta
    Shivani Saxena
    Tushar Shahapurkar
    Wireless Personal Communications, 2023, 132 : 609 - 627
  • [9] Digital Image Based Segmentation and Classification of Tongue Cancer Using CNN
    Pahadiya, Pallavi
    Vijay, Ritu
    Gupta, Kumod Kumar
    Saxena, Shivani
    Shahapurkar, Tushar
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 132 (01) : 609 - 627
  • [10] Asymmetry analysis using automatic segmentation and classification for breast cancer detection in thermograms
    Qi, HR
    Head, JF
    PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 2866 - 2869