Improved MFCC-Based Feature for Robust Speaker Identification

被引:11
|
作者
吴尊敬
曹志刚
机构
[1] China
[2] Department of Electronic Engineering
[3] Tsinghua University
[4] State Key Laboratory on Microwave and Digital Communications
[5] Beijing 100084
基金
中国国家自然科学基金;
关键词
Mel-frequency cepstral coefficient (MFCC); robust speaker identification; feature extraction;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the performance of recognition systems because of the mismatches between training and testing. In this paper, the logarithmic transformation in the standard MFCC analysis is replaced by a combined function to improve the noisy sensitivity. The proposed feature extraction process is also combined with speech en- hancement methods, such as spectral subtraction and median-filter to further suppress the noise. Experi- ments show that the proposed robust MFCC-based feature significantly reduces the recognition error rate over a wide signal-to-noise ratio range.
引用
收藏
页码:158 / 161
页数:4
相关论文
共 50 条
  • [21] Robust speaker identification based on selective use of feature vectors
    Kwon, Soonil
    Narayanan, Shrikanth
    PATTERN RECOGNITION LETTERS, 2007, 28 (01) : 85 - 89
  • [22] Robust feature based on speech harmonic structure for speaker identification
    College of Communication and Information Engineering, Nanjing Univ. of Posts and Telecom., Nanjing 210003, China
    Dianzi Yu Xinxi Xuebao, 2006, 10 (1786-1789):
  • [23] Speaker Identification Using MFCC Feature Extraction ANN Classification Technique
    Singh, Mahesh K.
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (01) : 453 - 467
  • [24] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
    Zhou, Xi
    Fu, Yun
    Liu, Ming
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
  • [25] Robust Automatic Speaker Identification System Using Shuffled MFCC Features
    Barhoush, Mahdi
    Hallawa, Ahmed
    Schmeink, Anke
    2021 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES (ICMLANT II), 2021, : 28 - 33
  • [26] Speaker identification based on combination of MFCC and UMRT based features
    Antony, Anett
    Gopikakumari, R.
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 250 - 257
  • [27] MFCC-Based Sound Classification of Honey Bees
    Libal, Urszula
    Biernacki, Pawel
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2024, 70 (04) : 849 - 853
  • [28] MFCc-based feature extraction model for long time period emotion speech using cnn
    Alhlffee M.
    Revue d'Intelligence Artificielle, 2020, 34 (02): : 117 - 123
  • [29] MFCC-based descriptor for bee queen presence detection
    Soares, Bianca Sousa
    Luz, Jederson Sousa
    de Macedo, Valderlandia Francisca
    Veloso e Silva, Romuere Rodrigues
    Duarte de Araujo, Flavio Henrique
    Vieira Magalhaes, Deborah Maria
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 201
  • [30] An MFCC-based Secure Framework for Voice Assistant Systems
    Ahmed, Syed Fahad
    Jaffari, Rabeea
    Ahmed, Syed Saad
    Jawaid, Moazzam
    Talpur, Shahnawaz
    2022 INTERNATIONAL CONFERENCE ON CYBER WARFARE AND SECURITY (ICCWS), 2022, : 57 - 61