Comparative Analysis of Deep Learning Models for Breast Cancer Classification on Multimodal Data

被引:0
|
作者
Hussain, Sadam [1 ]
Ali, Mansoor [1 ]
Ali Pirzado, Farman [1 ]
Ahmed, Masroor [1 ]
Gerardo Tamez-Pena, Jose [2 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Monterrey, Nuevo Leon, Mexico
[2] Tecnol Monterrey, Sch Med & Hlth Sci, Monterrey, Nuevo Leon, Mexico
关键词
Breast Cancer; Feature Fusion; Multi-modal Classification; Deep Learning; Vision Transformer; COMPUTER-AIDED DETECTION; MAMMOGRAMS; DIAGNOSIS;
D O I
10.1145/3689096.3689462
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rising breast cancer incidence and mortality represent significant global challenges for women. Deep learning has demonstrated superior diagnostic performance in breast cancer classification compared to human experts. However, most deep learning methods have relied on unimodal features, potentially limiting the performance of diagnostic models. Additionally, most studies conducted so far have used a single view of digital mammograms, which significantly reduces model performance due to limited overall perspective and generalizability. To address these limitations, we collected a multiview multimodal dataset, including digital mammograms four views two craniocaudal (CC), two mediolateral oblique (MLO) one for each breast, and textual data extracted from radiological reports. We propose a multimodal deep learning architecture for breast cancer classification, utilizing images (digital mammograms) and textual data (radiological reports) from our new in-house dataset. In addition, various augmentation techniques are applied to both imaging and textual data to enhance the training data size. In our investigation, we explored the performance of six state-of-the-art (SOTA) deep learning architectures: VGG16, VGG19, ResNet34, MobileNetV3, EfficientNetB7, and a vision transformer (ViT) as an imaging feature extractors. For textual feature extraction, we employed an artificial neural network (ANN). Afterwards, features were fused using an early fusion and late fusion strategy. The fused imaging and textual features were then inputted into an ANN classifier for breast cancer classification. We evaluated various feature extractors and an ANN classifier combinations, finding that VGG19 in association with ANN achieved the highest accuracy at 0.951. In terms of precision, again VGG19 and ANN combination surpassed other SOTA CNN and attention-based architectures, achieving a score of 0.95. The best sensitivity score of 0.893 was recorded by VGG16+ANN, followed by VGG19+ANN with 0.884. The highest F1 score of 0.922 was achieved by VGG19+ANN. VGG16+ANN achieved the best area under the curve (AUC) score of 0.929, closely followed by VGG19+ANN with a score of 0.915.
引用
收藏
页码:31 / 39
页数:9
相关论文
共 50 条
  • [41] Comparative analysis of classification algorithms on the breast cancer recurrence using machine learning
    Valentina Mikhailova
    Gholamreza Anbarjafari
    Medical & Biological Engineering & Computing, 2022, 60 : 2589 - 2600
  • [42] Classification of Breast Cancer Histology Using Deep Learning
    Golatkar, Aditya
    Anand, Deepak
    Sethi, Amit
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2018), 2018, 10882 : 837 - 844
  • [43] Enhancing fetal electrocardiogram classification: A hybrid approach incorporating multimodal data fusion and advanced deep learning models
    Ziani S.
    Multimedia Tools and Applications, 2024, 83 (18) : 55011 - 55051
  • [44] Fine tuning deep learning models for breast tumor classification
    Heikal, Abeer
    El-Ghamry, Amir
    Elmougy, Samir
    Rashad, M. Z.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [45] Comparative Analysis of Traditional Machine Learning and Transformer-based Deep Learning Models for Text Classification
    Aydin, Nazif
    Erdem, Osman Ayhan
    Tekerek, Adem
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2024,
  • [46] Skin Cancer Classification using Deep Learning Models
    Kahia, Marwa
    Echtioui, Amira
    Kallel, Fathi
    Ben Hamida, Ahmed
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2022, : 554 - 559
  • [47] Multimodal data fusion for cancer biomarker discovery with deep learning
    Steyaert, Sandra
    Pizurica, Marija
    Nagaraj, Divya
    Khandelwal, Priya
    Hernandez-Boussard, Tina
    Gentles, Andrew J.
    Gevaert, Olivier
    NATURE MACHINE INTELLIGENCE, 2023, 5 (04) : 351 - 362
  • [48] Multimodal data fusion for cancer biomarker discovery with deep learning
    Sandra Steyaert
    Marija Pizurica
    Divya Nagaraj
    Priya Khandelwal
    Tina Hernandez-Boussard
    Andrew J. Gentles
    Olivier Gevaert
    Nature Machine Intelligence, 2023, 5 : 351 - 362
  • [49] Forecasting glaucoma from multimodal data using deep learning models
    Huang, Xiaoqin
    Poursoroush, Asma
    Madadi, Yeganeh
    Raja, Hina
    Delsoz, Mohammad
    Pasquale, Louis R.
    Boland, Michael V.
    Johnson, Chris A.
    Zebardast, Nazlee
    Yousefi, Siamak
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [50] Implementing Cyclical Learning Rates in Deep Learning Models for Data Classification
    Al-Khamees, Hussein A. A.
    Manaa, Mehdi Ebady
    Obaid, Zahraa Hazim
    Mohammedali, Noor Abdalkarem
    FORTHCOMING NETWORKS AND SUSTAINABILITY IN THE AIOT ERA, VOL 1, FONES-AIOT 2024, 2024, 1035 : 205 - 215