ConvNeXt-ST-AFF: A Novel Skin Disease Classification Model Based on Fusion of ConvNeXt and Swin Transformer

被引:4
|
作者
Hao, Shengnan [1 ]
Zhang, Liguo [1 ]
Jiang, Yanyan [2 ]
Wang, Jingkun [3 ]
Ji, Zhanlin [1 ,4 ]
Zhao, Li [3 ]
Ganchev, Ivan [4 ,5 ,6 ]
机构
[1] North China Univ Sci & Technol, Dept Artificial Intelligence, Tangshan 063009, Peoples R China
[2] Hebei Agr Univ, Coll Urban & Rural Construct, Baoding 071000, Peoples R China
[3] Tsinghua Univ, Inst Precis Med, Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China
[4] Univ Limerick, Telecommun Res Ctr TRC, Limerick V94 T9PX, Ireland
[5] Univ Plovdiv Paisii Hilendarski, Dept Comp Syst, Plovdiv 4000, Bulgaria
[6] Bulgarian Acad Sci, Inst Math & Informat, Sofia 1040, Bulgaria
关键词
Skin disease classification; image denoising; model fusion; attention; ConvNeXt; swin transformer; NEURAL-NETWORK; LOCALIZATION; FRAMEWORK; FEATURES; SYSTEM;
D O I
10.1109/ACCESS.2023.3324042
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic classification of dermatological images is an important technology that assists doctors in performing faster and more accurate classification of skin diseases. Recently, convolutional neural networks (CNNs) and Transformer networks have been employed in learning respectively the local and global features of lesion images. However, existing works mainly focus on utilizing a single neural network for feature extraction, which limits the model classification performance. In order to tackle this problem, a novel fusion model, named ConvNeXt-ST-AFF, is proposed in this paper, by combining the strengths of ConvNeXt and Swin Transformer (ConvNeXt-ST in the model's name). In the proposed model, the pretrained ConvNeXt and Swin Transformer networks extract local and global features from images, which are then fused using Attentional Feature Fusion (AFF) submodules (AFF in the model's name). Additionally, in order to enhance the model's attention on the regions of skin lesions during training, an Efficient Channel Attention (ECA) module is incorporated into the ConvNeXt network. Moreover, the proposed model employs a denoising module to reduce the influence of artifacts and improve the image contrast. The results, obtained by experiments conducted on two datasets, demonstrate that the proposed ConvNeXt-ST-AFF model has higher classification ability, according to multiple evaluation metrics, compared to the original ConvNeXt and Swin Transformer, and other state-of-the-art classification models.
引用
收藏
页码:117460 / 117473
页数:14
相关论文
共 50 条
  • [41] CSF-GTNet: A Novel Multi-Dimensional Feature Fusion Network Based on Convnext-GeLU-BiLSTM for EEG-Signals-Enabled Fatigue Driving Detection
    Gao, Dongrui
    Li, Pengrui
    Wang, Manqing
    Liang, Yujie
    Liu, Shihong
    Zhou, Jiliu
    Wang, Lutao
    Zhang, Yongqing
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (05) : 2558 - 2568
  • [42] Swin Transformer based fluid classification using Gram angle field-converted well logging data: A novel approach
    Sun, Youzhuang
    Zhang, Junhua
    Zhang, Yongan
    PHYSICS OF FLUIDS, 2024, 36 (01)
  • [43] EFFNet: A skin cancer classification model based on feature fusion and random forests
    Ma, Xiaopu
    Shan, Jiangdan
    Ning, Fei
    Li, Wentao
    Li, He
    PLOS ONE, 2023, 18 (10):
  • [44] Swin Transformer-Based Segmentation and Multi-Scale Feature Pyramid Fusion Module for Alzheimer's Disease with Machine Learning
    Gharaibeh, Nasr
    Abu-Ein, Ashraf A.
    Al-hazaimeh, Obaida M.
    Nahar, Khalid M. O.
    Abu-Ain, Waleed A.
    Al-Nawashi, Malek M.
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (04) : 22 - 50
  • [45] COVID-19 diagnosis based on swin transformer model with demographic information fusion and enhanced multi-head attention mechanism
    Sun, Yunlong
    Lian, Jingge
    Teng, Ze
    Wei, Ziyi
    Tang, Yi
    Yang, Liu
    Gao, Yajuan
    Wang, Tianfu
    Li, Hongfeng
    Xu, Meng
    Lei, Baiying
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 243
  • [46] Multi-label classification of retinal disease via a novel vision transformer model
    Wang, Dong
    Lian, Jian
    Jiao, Wanzhen
    FRONTIERS IN NEUROSCIENCE, 2024, 17
  • [47] COVID-19 Diagnosis Based on Swin Transformer Model with Demographic Information Fusion and Enhanced Multi-head Attention Mechanism
    Sun, Yunlong
    Liu, Yiyao
    Qu, Junlong
    Dong, Xiang
    Song, Xuegang
    Lei, Baiying
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT II, 2024, 14349 : 194 - 204
  • [48] ST-YOLOA: a Swin-transformer-based YOLO model with an attention mechanism for SAR ship detection under complex background
    Zhao, Kai
    Lu, Ruitao
    Wang, Siyu
    Yang, Xiaogang
    Li, Qingge
    Fan, Jiwei
    FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [49] A novel adjunctive diagnostic method for bone cancer: Osteosarcoma cell segmentation based on Twin Swin Transformer with multi-scale feature fusion
    Wen, Tingxi
    Tong, Binbin
    Fu, Yuqing
    Li, Yunfeng
    Ling, Mengde
    Chen, Xinwen
    JOURNAL OF BONE ONCOLOGY, 2024, 49
  • [50] Disease Classification Model Based on Multi-Modal Feature Fusion
    Wan, Zhengyu
    Shao, Xinhui
    IEEE ACCESS, 2023, 11 : 27536 - 27545