Enhancing intra-aural disease classification with attention-based deep learning models

被引:0
|
作者
Furkancan Demircan [1 ]
Murat Ekinci [2 ]
Zafer Cömert [1 ]
机构
[1] Samsun University,Software Engineering, Faculty of Engineering and Natural Sciences
[2] Karadeniz Technical University,Computer Engineering, Faculty of Engineering
[3] Department of Technical Sciences of the Western Caspian University,undefined
关键词
Classification; Ear diseases; Deep learning; Transformers; Machine learning;
D O I
10.1007/s00521-025-10990-4
中图分类号
学科分类号
摘要
Ear diseases are defined as pathological conditions that indicate dysfunction or abnormal function of the ear organ, which is part of the auditory system of living organisms that regulates hearing and balance functions. These diseases usually manifest as conditions that affect the internal components of the ear structure and can manifest themselves with symptoms such as hearing loss, ear pain, balance problems, and fluid accumulation in the ear. The accuracy of the diagnosis depends on expert knowledge and subjective opinion. This method is prone to human error. This study presents a novel computer-aided diagnosis system for otoscope images of ear diseases, utilizing a vision transformer-based feature extractor combined with machine learning classifiers to provide accurate second opinions for ENT specialists. For this purpose, a new model based on state-of-the-art vision transformer feature extractor and machine learning models is proposed. In the experimental study, the dataset, comprising 880 eardrum images categorized into four classes (CSOM, earwax, myringosclerosis, and normal), was split into training (70%), validation (10%), and testing (20%) subsets. Each image was preprocessed to 420 × 380 pixels to fit the input dimensions of the models. The vision transformer architecture was utilized for feature extraction, followed by classification using various machine learning algorithms including kNN, SVM, and random forest. As a result, the model using vision transformer feature extractor and k-nearest neighbors (kNN) algorithm achieved 99.00% accuracy. In this study, a deep learning-based and computer-aided diagnosis system, in other words, a computational model, was developed instead of the current human error-prone disease diagnosis method used by ear nose throat (ENT) specialists. The main purpose of the deep learning-based decision support system is to support the diagnosis process where expert knowledge is difficult to access and to provide an alternative opinion to the expert diagnosis.
引用
收藏
页码:6601 / 6616
页数:15
相关论文
共 50 条
  • [41] Understanding stance classification of BERT models: an attention-based framework
    Carlos Abel Córdova Sáenz
    Karin Becker
    Knowledge and Information Systems, 2024, 66 : 419 - 451
  • [42] On Exploring Attention-based Explanation for Transformer Models in Text Classification
    Liu, Shengzhong
    Le, Franck
    Chakraborty, Supriyo
    Abdelzaher, Tarek
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1193 - 1203
  • [43] Malware Classification Using Attention-Based Transductive Learning Network
    Deng, Liting
    Wen, Hui
    Xin, Mingfeng
    Sun, Yue
    Sun, Limin
    Zhu, Hongsong
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS (SECURECOMM 2020), PT II, 2020, 336 : 403 - 418
  • [44] Graph Attention-Based Curriculum Learning for Mental Healthcare Classification
    Ahmed, Usman
    Lin, Jerry Chun-Wei
    Srivastava, Gautam
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (05) : 2581 - 2591
  • [45] Multimodal Attention-Based Learning for Imbalanced Corporate Documents Classification
    Mahamoud, Ibrahim Souleiman
    Voerman, Joris
    Coustaty, Mickael
    Joseph, Aurelie
    D'Andecy, Vincent Poulain
    Ogier, Jean-Marc
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 223 - 237
  • [46] Attention-Based Hybrid Deep Learning Models for Classifying COVID-19 Genome Sequences
    Mutawa, A. M.
    AI, 2025, 6 (01)
  • [47] Scattering Representation and Attention-Based Residual Learning for Image Classification
    Kaur, Manjeet
    Ahmad, M. Omair
    Swamy, M. N. S.
    2024 IEEE 67TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, MWSCAS 2024, 2024, : 724 - 728
  • [48] Attention-based deep learning for accurate cell image analysis
    Gao, Xiangrui
    Zhang, Fan
    Guo, Xueyu
    Yao, Mengcheng
    Wang, Xiaoxiao
    Chen, Dong
    Zhang, Genwei
    Wang, Xiaodong
    Lai, Lipeng
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [49] Generalized attention-based deep multi-instance learning
    Lu Zhao
    Liming Yuan
    Kun Hao
    Xianbin Wen
    Multimedia Systems, 2023, 29 : 275 - 287
  • [50] Generalized attention-based deep multi-instance learning
    Zhao, Lu
    Yuan, Liming
    Hao, Kun
    Wen, Xianbin
    MULTIMEDIA SYSTEMS, 2023, 29 (01) : 275 - 287