Improved Laryngeal Pathology Detection Based on Bottleneck Convolutional Networks and MFCC

被引:0
|
作者
Korba, Mohamed Cherif Amara [1 ]
Doghmane, Hakim [2 ]
Khelil, Khaled [1 ]
Messaoudi, Kamel [1 ]
机构
[1] Mohamed Cher Messaadia Univ Souk Ahras, LEER Lab, Souk Ahras 41000, Algeria
[2] Univ 8 Mai 1945 Guelma, PI MIS Lab, Guelma 24000, Algeria
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Mel frequency cepstral coefficient; Pathology; Databases; Accuracy; Support vector machines; Perturbation methods; Larynx; Laryngeal pathologies detection; convolutional bottleneck network; HUPA database; glottal features; perturbation features; support vector machine; random forest; extreme gradient boosting; FREQUENCY CEPSTRAL COEFFICIENTS; EPOCH EXTRACTION; VOICE; SPEECH; INFORMATION; HEALTHY;
D O I
10.1109/ACCESS.2024.3454825
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic detection of laryngeal disorders via voice analysis allows for early diagnosis. However, the effectiveness of AI-based detection methods is often limited, mainly due to insufficient training data subject to confidentiality constraints, as well as the wide range of pathologies, which hinders accurate detection. To address these issues, an automatic voice disorder detection (AVDD) system is proposed, employing an innovative AI-based feature extraction approach to improve detection performance. The approach, termed MFCC-CBN, employs Mel-frequency cepstral coefficients (MFCC) with a convolutional bottleneck network (CBN). It also integrates a diverse feature set, such as measurements related to the fundamental frequency (F0) perturbation, features specific to the glottal source, and conventional MFCC features. The proposed approach is validated through comprehensive experiments on the public database of the Pr & iacute;ncipe de Asturias University Hospital (HUPA), which contains recordings of sustained vowels. The method is tested using various classifiers, including Support Vector Machine (SVM), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost). The obtained results show that our method provides a high detection rate and maintains stable performance regardless of the classifier used, which reveals its good generalization. A 5-fold cross-validation technique is adopted for the performance evaluation of the AVDD system. The optimal feature configuration surpasses state-of-the-art results, achieving a classification accuracy of 88.79% and an F1-score of 0.88.
引用
收藏
页码:124801 / 124815
页数:15
相关论文
共 50 条
  • [1] Heart sound classification based on improved MFCC features and convolutional recurrent neural networks
    Deng, Muqing
    Meng, Tingting
    Cao, Jiuwen
    Wang, Shimin
    Zhang, Jing
    Fan, Huijie
    NEURAL NETWORKS, 2020, 130 : 22 - 32
  • [2] RETRACTED: An Analytical Study of Speech Pathology Detection Based on MFCC and Deep Neural Networks (Retracted Article)
    Zakariah, Mohammed
    Reshma, B.
    Alotaibi, Yousef Ajmi
    Guo, Yanhui
    Tran-Trung, Kiet
    Elahi, Mohammad Mamun
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [3] An Improved Endpoint Detection Algorithm Based on MFCC Cosine Value
    Cao, Danyang
    Gao, Xue
    Gao, Lei
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 95 (03) : 2073 - 2090
  • [4] LARYNGEAL PATHOLOGY DETECTION
    CHILDERS, DG
    COLEMAN, RF
    CRC CRITICAL REVIEWS IN BIOENGINEERING, 1977, 2 (04): : 375 - 425
  • [5] An Improved Endpoint Detection Algorithm Based on MFCC Cosine Value
    Danyang Cao
    Xue Gao
    Lei Gao
    Wireless Personal Communications, 2017, 95 : 2073 - 2090
  • [6] Information Bottleneck Theory on Convolutional Neural Networks
    Li, Junjie
    Liu, Ding
    NEURAL PROCESSING LETTERS, 2021, 53 (02) : 1385 - 1400
  • [7] Information Bottleneck Theory on Convolutional Neural Networks
    Junjie Li
    Ding Liu
    Neural Processing Letters, 2021, 53 : 1385 - 1400
  • [8] IMPROVED MUSICAL ONSET DETECTION WITH CONVOLUTIONAL NEURAL NETWORKS
    Schlueter, Jan
    Boeck, Sebastian
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Community detection in networks based on information bottleneck clustering
    Liu, Yongli
    Yang, Tengfei
    Fu, Lili
    Liu, Jing
    Journal of Computational Information Systems, 2015, 11 (02): : 693 - 700
  • [10] Blockchain cryptocurrency abnormal behavior detection based on improved graph convolutional neural networks
    Li, Xiaohan
    Yang, Yanbo
    Li, Baoshan
    Li, Minchao
    Zhang, Jiawei
    Li, Teng
    2023 INTERNATIONAL CONFERENCE ON DATA SECURITY AND PRIVACY PROTECTION, DSPP, 2023, : 216 - 222