Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features

被引:17
|
作者
Eskidere, Omer [1 ]
Gurhanli, Ahmet [2 ]
机构
[1] Bursa Orhangazi Univ, Dept Elect Elect Engn, TR-16310 Bursa, Turkey
[2] Bursa Orhangazi Univ, Dept Comp Engn, TR-16310 Bursa, Turkey
关键词
PATHOLOGICAL VOICE; ACOUSTIC ANALYSIS; PERTURBATION; MFCC;
D O I
10.1155/2015/956249
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Mel Frequency Cepstral Coefficients (MFCCs) are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper) characterized by large variance. This study shows investigations on reducing variance for the classification of two different voice qualities (normal voice and disordered voice) using multitaper MFCC features. We also compare their performance by newly proposed windowing techniques and conventional single-taper technique. The results demonstrate that adapted weighted Thomson multitaper method could distinguish between normal voice and disordered voice better than the results done by the conventional single-taper (Hamming window) technique and two newly proposed windowing methods. The multitaper MFCC features may be helpful in identifying voices at risk for a real pathology that has to be proven later.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Cough Recognition Based on Mel Frequency Cepstral Coefficients and Dynamic Time Warping
    Zhu, Chunmei
    Liu, Baojun
    Li, Ping
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING AND CONTROL SYSTEMS (MECS2015), 2016, : 326 - 329
  • [32] Improved DTW Speech Recognition Algorithm Based on the MEL Frequency Cepstral Coefficients
    Wei Ming-zhe
    Li Xi
    Ren Li-mian
    12TH ANNUAL MEETING OF CHINA ASSOCIATION FOR SCIENCE AND TECHNOLOGY ON INFORMATION AND COMMUNICATION TECHNOLOGY AND SMART GRID, 2010, : 235 - 238
  • [33] PARABOLIC FILTER MEL FREQUENCY CEPSTRAL COEFFICIENT AND FUSION OF FEATURES FOR SPEAKER AGE CLASSIFICATION
    Osman, Mohammed Muntaz
    Buyuk, Osman
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2020, 38 (04): : 2177 - 2191
  • [34] Speech reconstruction from mel frequency cepstral coefficients and pitch frequency
    Chazan, D
    Hoory, R
    Cohen, G
    Zibulski, M
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1299 - 1302
  • [35] Voice spoofing detection using a neural networks assembly considering spectrograms and mel frequency cepstral coefficients
    Hernandez-Nava, Carlos Alberto
    Rincon-Garcia, Eric Alfredo
    Lara-Velazquez, Pedro
    de-los-Cobos-Silva, Sergio Gerardo
    Gutierrez-Andrade, Miguel Angel
    Mora-Gutierrez, Roman Anselmo
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [36] Computing Mel-frequency cepstral coefficients on the power spectrum
    Molau, S
    Pitz, M
    Schlüter, R
    Ney, H
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 73 - 76
  • [37] A Novel Deep Learning Approach for Classification of Bird Sound Using Mel Frequency Cepstral Coefficients
    Saad, Aymen
    Zabidi, Muhammad Mun'im Ahmad
    Kamil, Israa S.
    Sheikh, Usman Ullah
    Iraqi Journal for Computer Science and Mathematics, 2024, 5 (03): : 660 - 670
  • [38] Fingerprint Recognition Using Mel-Frequency Cepstral Coefficients
    Hashad F.G.
    Halim T.M.
    Diab S.M.
    Sallam B.M.
    El-Samie F.E.A.
    Pattern Recognition and Image Analysis, 2010, 20 (03) : 360 - 369
  • [39] Chip design of mel frequency cepstral coefficients for speech recognition
    Wang, JC
    Wang, JF
    Weng, YS
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3658 - 3661
  • [40] Mel-frequency Cepstral Coefficients for Eye Movement Identification
    Nguyen Viet Cuong
    Vu Dinh
    Lam Si Tung Ho
    2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 253 - 260