ISAFusionNet: Involution and soft attention based deep multi-modal fusion network for multi-label skin lesion classification

被引:0
|
作者
Mohammed, Hussein M. A. [1 ]
Omeroglu, Asli Nur [1 ]
Oral, Emin Argun [1 ,2 ]
Ozbek, I. Yucel [1 ,2 ]
机构
[1] Ataturk Univ, Dept Elect Engn, TR-25240 Erzurum, Turkiye
[2] Ataturk Univ, High Performance Comp Applicat & Res Ctr, TR-25240 Erzurum, Turkiye
关键词
Multi-label skin lesion classification; Multi-modal fusion; Involution; Soft attention; CHECKLIST;
D O I
10.1016/j.compeleceng.2024.109966
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Skin lesions have morphological diversity, and their classification is a challenging task due to the large inter-class similarity and intra-class variation. To address this, an involution and soft attention based multimodal hybrid fusion network, ISAFusionNet, is proposed for automatic multi-label skin lesion classification. The proposed method is composed of two feature extraction branches and a hybrid fusion branch. The feature extraction branches utilize involution modules within multiple residual blocks to improve the visual representation of dermoscopy and clinical image information. The hybrid fusion branch, on the other hand, complementarily fuses the features of two image modalities in a multi-layer sense and combine them with meta-data features. This branch is composed of multiple soft attention modules to focus on the most relevant skin lesion areas. The proposed multi-modal method is evaluated on the seven-point checklist dataset, and an average accuracy of 85.6% is achieved for multi-label classification. Average sensitivity, specificity, precision and AUC results of 74.8%, 89%, 85.2% and 94.3% were obtained, respectively. These results indicate that the proposed ISAFusionNet improves the average accuracy by 3.13% compared to the existing state-of-the-art model. In this sense, involution and soft attention based deep multi-modal hybrid fusion network yields satisfactory performance for multi-label skin lesion classification problem.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Context Recognition In-the-Wild: Unified Model for Multi-Modal Sensors and Multi-Label Classification
    Vaizman, Yonatan
    Weibel, Nadir
    Lanckriet, Gert
    Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2017, 1 (04)
  • [42] Soft Computing Based Evolutionary Multi-Label Classification
    Aslam, Rubina
    Tamimy, Manzoor Illahi
    Aslam, Waqar
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2020, 26 (06): : 1233 - 1249
  • [43] Deep Multi-Modal Hashing With Semantic Enhancement for Multi-Label Micro-Video Retrieval
    Jing, Peiguang
    Sun, Haoyi
    Nie, Liqiang
    Li, Yun
    Su, Yuting
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5080 - 5091
  • [44] A semantic guidance-based fusion network for multi-label image classification
    Wang, Jiuhang
    Tang, Hongying
    Luo, Shanshan
    Yang, Liqi
    Liu, Shusheng
    Hong, Aoping
    Li, Baoqing
    PATTERN RECOGNITION LETTERS, 2024, 185 : 254 - 261
  • [45] Research on Emotion Classification Based on Multi-modal Fusion
    Xiang, Zhihua
    Radzi, Nor Haizan Mohamed
    Hashim, Haslina
    BAGHDAD SCIENCE JOURNAL, 2024, 21 (02) : 548 - 560
  • [46] Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning
    Li, Wei
    Sun, Maosong
    Habel, Christopher
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2007, 2007, 4810 : 744 - +
  • [47] MHM: Multi-modal Clinical Data based Hierarchical Multi-label Diagnosis Prediction
    Qiao, Zhi
    Zhang, Zhen
    Wu, Xian
    Ge, Shen
    Fan, Wei
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1841 - 1844
  • [48] A Label-Specific Attention-Based Network with Regularized Loss for Multi-label Classification
    Luo, Xiangyang
    Ran, Xiangying
    Sun, Wei
    Xu, Yunlai
    Wang, Chongjun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 731 - 742
  • [49] Memory based fusion for multi-modal deep learning
    Priyasad, Darshana
    Fernando, Tharindu
    Denman, Simon
    Sridharan, Sridha
    Fookes, Clinton
    INFORMATION FUSION, 2021, 67 : 136 - 146
  • [50] Multi-modal fusion attention sentiment analysis for mixed sentiment classification
    Xue, Zhuanglin
    Xu, Jiabin
    COGNITIVE COMPUTATION AND SYSTEMS, 2024,