Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features

被引:17
|
作者
Eskidere, Omer [1 ]
Gurhanli, Ahmet [2 ]
机构
[1] Bursa Orhangazi Univ, Dept Elect Elect Engn, TR-16310 Bursa, Turkey
[2] Bursa Orhangazi Univ, Dept Comp Engn, TR-16310 Bursa, Turkey
关键词
PATHOLOGICAL VOICE; ACOUSTIC ANALYSIS; PERTURBATION; MFCC;
D O I
10.1155/2015/956249
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Mel Frequency Cepstral Coefficients (MFCCs) are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper) characterized by large variance. This study shows investigations on reducing variance for the classification of two different voice qualities (normal voice and disordered voice) using multitaper MFCC features. We also compare their performance by newly proposed windowing techniques and conventional single-taper technique. The results demonstrate that adapted weighted Thomson multitaper method could distinguish between normal voice and disordered voice better than the results done by the conventional single-taper (Hamming window) technique and two newly proposed windowing methods. The multitaper MFCC features may be helpful in identifying voices at risk for a real pathology that has to be proven later.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Palmprint recognition based on Mel frequency Cepstral coefficients feature extraction
    Fahmy, Maged M. M.
    AIN SHAMS ENGINEERING JOURNAL, 2010, 1 (01) : 39 - 47
  • [22] A New Approach for Fingerprint Recognition Based on Mel Frequency Cepstral Coefficients
    Hashad, F. G.
    Halim, T. M.
    Diab, S. M.
    Sallam, B. M.
    2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES 2009), 2009, : 263 - +
  • [23] Voice Control for a Gripper using Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models
    Velasco-Hernandez, Gustavo
    Diaz-Toro, Andres
    2015 20TH SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND COMPUTER VISION (STSIVA), 2015,
  • [24] Comparative Study of Voice Conversion Framework with Line Spectral Frequency and Mel- Frequency Cepstral Coefficients as Features Using Artficial Neural Networks
    Bhuyan, Amit Kumar
    Nirmal, Jagannath H.
    2015 INTERNATIONAL CONFERENCE ON COMPUTERS, COMMUNICATIONS, AND SYSTEMS (ICCCS), 2015, : 230 - 235
  • [25] Bottleneck Features based on Gammatone Frequency Cepstral Coefficients
    Qi, Jun
    Wang, Dong
    Xu, Ji
    Tejedor, Javier
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1750 - 1754
  • [26] Speaker independent phoneme recognition based on fractal dimension (DF) and the mel-frequency cepstral coefficients features
    Fekkai, S
    Al-Akaidi, M
    Blackledge, JM
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4014 - 4014
  • [27] Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system
    Yang, Zhiyou
    Huang, Ying
    EVOLUTIONARY INTELLIGENCE, 2022, 15 (04) : 2485 - 2494
  • [28] Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system
    Zhiyou Yang
    Ying Huang
    Evolutionary Intelligence, 2022, 15 : 2485 - 2494
  • [29] Classification of auditory signals from a combine harvester based on Mel-frequency Cepstral coefficients and machine learning
    Thomas, Gabriel
    Simundsson, Avery
    Mann, Danny D.
    Balocco, Simone
    Canadian Biosystems Engineering / Le Genie des biosystems au Canada, 2021, 63 : 213 - 222
  • [30] Voice spoofing detection using a neural networks assembly considering spectrograms and mel frequency cepstral coefficients
    Hernández-Nava C.A.
    Rincón-García E.A.
    Lara-Velázquez P.
    de-los-Cobos-Silva S.G.
    Gutiérrez-Andrade M.A.
    Mora-Gutiérrez R.A.
    PeerJ Computer Science, 2023, 9