Improved MFCC-Based Feature for Robust Speaker Identification

被引:11
|
作者
吴尊敬
曹志刚
机构
[1] China
[2] Department of Electronic Engineering
[3] Tsinghua University
[4] State Key Laboratory on Microwave and Digital Communications
[5] Beijing 100084
基金
中国国家自然科学基金;
关键词
Mel-frequency cepstral coefficient (MFCC); robust speaker identification; feature extraction;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the performance of recognition systems because of the mismatches between training and testing. In this paper, the logarithmic transformation in the standard MFCC analysis is replaced by a combined function to improve the noisy sensitivity. The proposed feature extraction process is also combined with speech en- hancement methods, such as spectral subtraction and median-filter to further suppress the noise. Experi- ments show that the proposed robust MFCC-based feature significantly reduces the recognition error rate over a wide signal-to-noise ratio range.
引用
收藏
页码:158 / 161
页数:4
相关论文
共 50 条
  • [41] ROBUST FEATURE FRONT-END FOR SPEAKER IDENTIFICATION
    Liu, Gang
    Lei, Yun
    Hansen, John H. L.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4233 - 4236
  • [42] Improved MFCC Algorithm in Speaker Recognition System
    Shi, Yibo
    Wang, Li
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [43] Gender Identification of a Speaker Using MFCC and GMM
    Yucesoy, Ergun
    Nabiyev, Vasif V.
    2013 8TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2013, : 626 - 629
  • [44] MFCC and Similarity Measurements for Speaker Identification Systems
    Maazouzi, A.
    Aqili, N.
    Aamoud, A.
    Raji, M.
    Hammouch, A.
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES (ICEIT 2017), 2017,
  • [45] MFCC-based Houston Toad Call Detection using LSTM
    Al Bashit, Abdullah
    Valles, Damian
    2019 IEEE INTERNATIONAL SYMPOSIUM ON MEASUREMENT AND CONTROL IN ROBOTICS (ISMCR): ROBOTICS FOR THE BENEFIT OF HUMANITY, 2019,
  • [46] Feature extraction for poultry vocalization recognition based on improved MFCC
    Key Laboratory of Agricultural Bioenvironmental Engineering, College of Water Conservancy and Civil Engineering, China Agricultural University, Beijing 100083, China
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2008, 24 (11): : 202 - 205
  • [47] DNN-based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification
    Oo, Zeyan
    Kawakami, Yuta
    Wang, Longbiao
    Nakagawa, Seiichi
    Xiao, Xiong
    Iwahashi, Masahiro
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2204 - 2208
  • [48] MFCC-Based Bangla Vowel Phoneme Recognition from Micro Clips
    Paul, Bachchu
    Mukherjee, Himadri
    Phadikar, Santanu
    Roy, Kaushik
    INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 511 - 519
  • [49] A computer-aided MFCC-based HMM system for automatic auscultation
    Chauhan, Sunita
    Wang, Ping
    Lim, Chu Sing
    Anantharaman, V.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2008, 38 (02) : 221 - 233
  • [50] Gender Detection with Heart Sound Using MFCC-based Statistical Features
    Dal, Ferda
    Ozbek, I. Yucel
    2017 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2017, : 553 - 556