Improved Laryngeal Pathology Detection Based on Bottleneck Convolutional Networks and MFCC

被引:0
|
作者
Korba, Mohamed Cherif Amara [1 ]
Doghmane, Hakim [2 ]
Khelil, Khaled [1 ]
Messaoudi, Kamel [1 ]
机构
[1] Mohamed Cher Messaadia Univ Souk Ahras, LEER Lab, Souk Ahras 41000, Algeria
[2] Univ 8 Mai 1945 Guelma, PI MIS Lab, Guelma 24000, Algeria
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Mel frequency cepstral coefficient; Pathology; Databases; Accuracy; Support vector machines; Perturbation methods; Larynx; Laryngeal pathologies detection; convolutional bottleneck network; HUPA database; glottal features; perturbation features; support vector machine; random forest; extreme gradient boosting; FREQUENCY CEPSTRAL COEFFICIENTS; EPOCH EXTRACTION; VOICE; SPEECH; INFORMATION; HEALTHY;
D O I
10.1109/ACCESS.2024.3454825
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic detection of laryngeal disorders via voice analysis allows for early diagnosis. However, the effectiveness of AI-based detection methods is often limited, mainly due to insufficient training data subject to confidentiality constraints, as well as the wide range of pathologies, which hinders accurate detection. To address these issues, an automatic voice disorder detection (AVDD) system is proposed, employing an innovative AI-based feature extraction approach to improve detection performance. The approach, termed MFCC-CBN, employs Mel-frequency cepstral coefficients (MFCC) with a convolutional bottleneck network (CBN). It also integrates a diverse feature set, such as measurements related to the fundamental frequency (F0) perturbation, features specific to the glottal source, and conventional MFCC features. The proposed approach is validated through comprehensive experiments on the public database of the Pr & iacute;ncipe de Asturias University Hospital (HUPA), which contains recordings of sustained vowels. The method is tested using various classifiers, including Support Vector Machine (SVM), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost). The obtained results show that our method provides a high detection rate and maintains stable performance regardless of the classifier used, which reveals its good generalization. A 5-fold cross-validation technique is adopted for the performance evaluation of the AVDD system. The optimal feature configuration surpasses state-of-the-art results, achieving a classification accuracy of 88.79% and an F1-score of 0.88.
引用
收藏
页码:124801 / 124815
页数:15
相关论文
共 50 条
  • [21] An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks
    Cao, Danyang
    Chen, Zhixin
    Gao, Lei
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2020, 10 (01)
  • [22] Traffic sign recognition based on improved convolutional networks
    Zhang K.
    Hou J.
    Liu M.
    Liu J.
    Zhang, Ke (zkwy2004@126.com), 1600, Inderscience Publishers (21): : 274 - 284
  • [23] An Improved Touching-Cell Splitting Algorithm Based On Bottleneck Detection
    Wu, Yanhai
    Zhao, Shaohua
    Wu, Nan
    Zhang, Hao
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1159 - +
  • [24] A topic link detection method based on improved information bottleneck theory
    Yang, Yu-Zhen
    Liu, Pei-Yu
    Fei, Shao-Dong
    Zhang, Cheng-Gong
    Zidonghua Xuebao/Acta Automatica Sinica, 2014, 40 (03): : 471 - 479
  • [25] Improved Gender Independent Speaker Recognition Using Convolutional Neural Network Based Bottleneck Features
    Ranjan, Shivesh
    Hansen, John H. L.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1009 - 1013
  • [26] Speech Emotion Recognition Based on Improved MFCC
    Wang, Yan
    Hu, Weiping
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [27] Voice pathology detection using optimized convolutional neural networks and explainable artificial intelligence-based analysis
    Jegan, Roohum
    Jayagowri, R.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2024, 27 (14) : 2041 - 2057
  • [28] Improved Lane Detection With Multilevel Features in Branch Convolutional Neural Networks
    Yang, Wei-Jong
    Cheng, Yoa-Teng
    Chung, Pau-Choo
    IEEE ACCESS, 2019, 7 : 173148 - 173156
  • [29] Improved Object Detection With Iterative Localization Refinement in Convolutional Neural Networks
    Cheng, Kai-Wen
    Chen, Yie-Tarng
    Fang, Wen-Hsien
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (09) : 2261 - 2275
  • [30] ITERATIVE LOCALIZATION REFINEMENT IN CONVOLUTIONAL NEURAL NETWORKS FOR IMPROVED OBJECT DETECTION
    Cheng, Kai-Wen
    Chen, Yie-Tarng
    Fang, Wen-Hsien
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3643 - 3647