ISAFusionNet: Involution and soft attention based deep multi-modal fusion network for multi-label skin lesion classification

被引:0
|
作者
Mohammed, Hussein M. A. [1 ]
Omeroglu, Asli Nur [1 ]
Oral, Emin Argun [1 ,2 ]
Ozbek, I. Yucel [1 ,2 ]
机构
[1] Ataturk Univ, Dept Elect Engn, TR-25240 Erzurum, Turkiye
[2] Ataturk Univ, High Performance Comp Applicat & Res Ctr, TR-25240 Erzurum, Turkiye
关键词
Multi-label skin lesion classification; Multi-modal fusion; Involution; Soft attention; CHECKLIST;
D O I
10.1016/j.compeleceng.2024.109966
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Skin lesions have morphological diversity, and their classification is a challenging task due to the large inter-class similarity and intra-class variation. To address this, an involution and soft attention based multimodal hybrid fusion network, ISAFusionNet, is proposed for automatic multi-label skin lesion classification. The proposed method is composed of two feature extraction branches and a hybrid fusion branch. The feature extraction branches utilize involution modules within multiple residual blocks to improve the visual representation of dermoscopy and clinical image information. The hybrid fusion branch, on the other hand, complementarily fuses the features of two image modalities in a multi-layer sense and combine them with meta-data features. This branch is composed of multiple soft attention modules to focus on the most relevant skin lesion areas. The proposed multi-modal method is evaluated on the seven-point checklist dataset, and an average accuracy of 85.6% is achieved for multi-label classification. Average sensitivity, specificity, precision and AUC results of 74.8%, 89%, 85.2% and 94.3% were obtained, respectively. These results indicate that the proposed ISAFusionNet improves the average accuracy by 3.13% compared to the existing state-of-the-art model. In this sense, involution and soft attention based deep multi-modal hybrid fusion network yields satisfactory performance for multi-label skin lesion classification problem.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A novel soft attention-based multi-modal deep learning framework for multi-label skin lesion classification
    Omeroglu, Asli Nur
    Mohammed, Hussein M. A.
    Oral, Emin Argun
    Aydin, Serdar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [2] Multi-modal bilinear fusion with hybrid attention mechanism for multi-label skin lesion classification
    Wei, Yun
    Ji, Lin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 65221 - 65247
  • [3] A multi-stage multi-modal learning algorithm with adaptive multimodal fusion for improving multi-label skin lesion classification
    Zuo, Lihan
    Wang, Zizhou
    Wang, Yan
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2025, 162
  • [4] A Deep Multi-Modal CNN for Multi-Instance Multi-Label Image Classification
    Song, Lingyun
    Liu, Jun
    Qian, Buyue
    Sun, Mingxuan
    Yang, Kuan
    Sun, Meng
    Abbas, Samar
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (12) : 6025 - 6038
  • [5] Complex Object Classification: A Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport
    Yang, Yang
    Wu, Yi-Feng
    Zhan, De-Chuan
    Liu, Zhi-Bin
    Jiang, Yuan
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2594 - 2603
  • [6] FusionM4Net: A multi-stage multi-modal learning algorithm for multi-label skin lesion classification
    Tang, Peng
    Yan, Xintong
    Nan, Yang
    Xiang, Shao
    Krammer, Sebastian
    Lasser, Tobias
    MEDICAL IMAGE ANALYSIS, 2022, 76
  • [7] Collaboration based multi-modal multi-label learning
    Zhang, Yi
    Zhu, Yinlong
    Zhang, Zhecheng
    Wang, Chongjung
    APPLIED INTELLIGENCE, 2022, 52 (12) : 14204 - 14217
  • [8] Collaboration based multi-modal multi-label learning
    Yi Zhang
    Yinlong Zhu
    Zhecheng Zhang
    Chongjung Wang
    Applied Intelligence, 2022, 52 : 14204 - 14217
  • [9] MSAFusionNet: Multiple Subspace Attention Based Deep Multi-modal Fusion Network
    Zhang, Sen
    Zhang, Changzheng
    Wang, Lanjun
    Li, Cixing
    Tu, Dandan
    Luo, Rui
    Qi, Guojun
    Luo, Jiebo
    MACHINE LEARNING IN MEDICAL IMAGING (MLMI 2019), 2019, 11861 : 54 - 62
  • [10] Cross-modal fusion for multi-label image classification with attention mechanism
    Wang, Yangtao
    Xie, Yanzhao
    Zeng, Jiangfeng
    Wang, Hanpin
    Fan, Lisheng
    Song, Yufan
    Computers and Electrical Engineering, 2022, 101