ISAFusionNet: Involution and soft attention based deep multi-modal fusion network for multi-label skin lesion classification

被引:0
|
作者
Mohammed, Hussein M. A. [1 ]
Omeroglu, Asli Nur [1 ]
Oral, Emin Argun [1 ,2 ]
Ozbek, I. Yucel [1 ,2 ]
机构
[1] Ataturk Univ, Dept Elect Engn, TR-25240 Erzurum, Turkiye
[2] Ataturk Univ, High Performance Comp Applicat & Res Ctr, TR-25240 Erzurum, Turkiye
关键词
Multi-label skin lesion classification; Multi-modal fusion; Involution; Soft attention; CHECKLIST;
D O I
10.1016/j.compeleceng.2024.109966
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Skin lesions have morphological diversity, and their classification is a challenging task due to the large inter-class similarity and intra-class variation. To address this, an involution and soft attention based multimodal hybrid fusion network, ISAFusionNet, is proposed for automatic multi-label skin lesion classification. The proposed method is composed of two feature extraction branches and a hybrid fusion branch. The feature extraction branches utilize involution modules within multiple residual blocks to improve the visual representation of dermoscopy and clinical image information. The hybrid fusion branch, on the other hand, complementarily fuses the features of two image modalities in a multi-layer sense and combine them with meta-data features. This branch is composed of multiple soft attention modules to focus on the most relevant skin lesion areas. The proposed multi-modal method is evaluated on the seven-point checklist dataset, and an average accuracy of 85.6% is achieved for multi-label classification. Average sensitivity, specificity, precision and AUC results of 74.8%, 89%, 85.2% and 94.3% were obtained, respectively. These results indicate that the proposed ISAFusionNet improves the average accuracy by 3.13% compared to the existing state-of-the-art model. In this sense, involution and soft attention based deep multi-modal hybrid fusion network yields satisfactory performance for multi-label skin lesion classification problem.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Multi-modal and Multi-label Emotion Detection for Comics Based on Two-Stream Network
    Lin Z.
    Zeng B.
    Pan Z.
    Wen S.
    Zeng, Bi (zb9215@gdut.edu.cn), 1600, Science Press (34): : 1017 - 1027
  • [22] MAGDRA: A Multi-modal Attention Graph Network with Dynamic Routing-By-Agreement for multi-label emotion recognition
    Li, Xingye
    Liu, Jin
    Xie, Yurong
    Gong, Peizhu
    Zhang, Xiliang
    He, Huihua
    KNOWLEDGE-BASED SYSTEMS, 2024, 283
  • [23] Multi-modal, Multi-task and Multi-label for Music Genre Classification and Emotion Regression
    Pandeya, Yagya Raj
    You, Jie
    Bhattarai, Bhuwan
    Lee, Joonwhoan
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1042 - 1045
  • [24] Multi-Label Image Classification by Feature Attention Network
    Yan, Zheng
    Liu, Weiwei
    Wen, Shiping
    Yang, Yin
    IEEE ACCESS, 2019, 7 : 98005 - 98013
  • [25] Multi-modal Multi-label Emotion Detection with Modality and Label Dependence
    Dong Zhang
    Ju, Xincheng
    Li, Junhui
    Li, Shoushan
    Zhu, Qiaoming
    Zhou, Guodong
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3584 - 3593
  • [26] Rethinking Modal-oriented Label Correlations for Multi-modal Multi-label Learning
    Zhang, Yi
    Shen, Jundong
    Zhang, Zhecheng
    Zhang, Lei
    Wang, Chongjun
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [27] Label-Guided Cross-Modal Attention Network for Multi-Label Aerial Image Classification
    Chen, Ying
    Zhang, Ding
    Han, Tao
    Meng, Xiaoliang
    Gao, Mianxin
    Wang, Teng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [28] Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer
    He, Sunan
    Guo, Taian
    Dai, Tao
    Qiao, Ruizhi
    Shu, Xiujun
    Ren, Bo
    Xia, Shu-Tao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 808 - 816
  • [29] Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection
    Ju, Xincheng
    Zhang, Dong
    Li, Junhui
    Zhou, Guodong
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 512 - 520
  • [30] A deep neural network based hierarchical multi-label classification method
    Feng, Shou
    Zhao, Chunhui
    Fu, Ping
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2020, 91 (02):