ISAFusionNet: Involution and soft attention based deep multi-modal fusion network for multi-label skin lesion classification

被引:0
|
作者
Mohammed, Hussein M. A. [1 ]
Omeroglu, Asli Nur [1 ]
Oral, Emin Argun [1 ,2 ]
Ozbek, I. Yucel [1 ,2 ]
机构
[1] Ataturk Univ, Dept Elect Engn, TR-25240 Erzurum, Turkiye
[2] Ataturk Univ, High Performance Comp Applicat & Res Ctr, TR-25240 Erzurum, Turkiye
关键词
Multi-label skin lesion classification; Multi-modal fusion; Involution; Soft attention; CHECKLIST;
D O I
10.1016/j.compeleceng.2024.109966
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Skin lesions have morphological diversity, and their classification is a challenging task due to the large inter-class similarity and intra-class variation. To address this, an involution and soft attention based multimodal hybrid fusion network, ISAFusionNet, is proposed for automatic multi-label skin lesion classification. The proposed method is composed of two feature extraction branches and a hybrid fusion branch. The feature extraction branches utilize involution modules within multiple residual blocks to improve the visual representation of dermoscopy and clinical image information. The hybrid fusion branch, on the other hand, complementarily fuses the features of two image modalities in a multi-layer sense and combine them with meta-data features. This branch is composed of multiple soft attention modules to focus on the most relevant skin lesion areas. The proposed multi-modal method is evaluated on the seven-point checklist dataset, and an average accuracy of 85.6% is achieved for multi-label classification. Average sensitivity, specificity, precision and AUC results of 74.8%, 89%, 85.2% and 94.3% were obtained, respectively. These results indicate that the proposed ISAFusionNet improves the average accuracy by 3.13% compared to the existing state-of-the-art model. In this sense, involution and soft attention based deep multi-modal hybrid fusion network yields satisfactory performance for multi-label skin lesion classification problem.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Multi-label classification of technical articles based on deep neural network
    Zhao, Qiuhan
    Yang, Wenchuan
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8391 - 8397
  • [32] Joint-individual fusion structure with fusion attention module for multi-modal skin cancer classification
    Tang, Peng
    Yan, Xintong
    Nan, Yang
    Hu, Xiaobin
    Menze, Bjoern H.
    Krammer, Sebastian
    Lasser, Tobias
    PATTERN RECOGNITION, 2024, 154
  • [33] Dual-Attention Deep Fusion Network for Multi-modal Medical Image Segmentation
    Zheng, Shenhai
    Ye, Xin
    Tan, Jiaxin
    Yang, Yifei
    Li, Laquan
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [34] Deep Attention Neural Network for Multi-Label Classification in Unmanned Aerial Vehicle Imagery
    Alshehri, Aaliyah
    Bazi, Yakoub
    Ammour, Nassim
    Almubarak, Haidar
    Alajlan, Naif
    IEEE ACCESS, 2019, 7 : 119873 - 119880
  • [35] Soft multi-modal data fusion
    Coppock, S
    Mazack, L
    PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 636 - 641
  • [36] Multi-Label Text Classification Based on Label-Sentence Bi-Attention Fusion Network with Multi-Level Feature Extraction
    Li, Anqi
    Zhang, Lin
    ELECTRONICS, 2025, 14 (01):
  • [37] Graph Attention Transformer Network for Multi-label Image Classification
    Yuan, Jin
    Chen, Shikai
    Zhang, Yao
    Shi, Zhongchao
    Geng, Xin
    Fan, Jianping
    Rui, Yong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [38] Attention-Based Multi-Modal Fusion Network for Semantic Scene Completion
    Li, Siqi
    Zou, Changqing
    Li, Yipeng
    Zhao, Xibin
    Gao, Yue
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11402 - 11409
  • [39] TFormer: A throughout fusion transformer for multi-modal skin lesion diagnosis
    Zhang, Yilan
    Xie, Fengying
    Chen, Jianqi
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [40] Multi-label remote sensing classification with self-supervised gated multi-modal transformers
    Liu, Na
    Yuan, Ye
    Wu, Guodong
    Zhang, Sai
    Leng, Jie
    Wan, Lihong
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2024, 18