Noise robust estimate of speech dynamics for speaker recognition

被引:0
|
作者
Openshaw, JP
Mason, JS
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for o-priori knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-band filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively.
引用
收藏
页码:925 / 928
页数:4
相关论文
共 50 条
  • [31] NOISE ADAPTATION ALGORITHMS FOR ROBUST SPEECH RECOGNITION
    CUNG, HM
    NORMANDIN, Y
    SPEECH COMMUNICATION, 1993, 12 (03) : 267 - 276
  • [32] SPECTRAL ESTIMATION FOR NOISE ROBUST SPEECH RECOGNITION
    ERELL, A
    WEINTRAUB, M
    SPEECH AND NATURAL LANGUAGE, 1989, : 319 - 324
  • [33] Stochastic features for noise robust speech recognition
    Iwahashi, N
    Pao, H
    Honda, H
    Minamino, K
    Omote, M
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 633 - 636
  • [34] Noise-Robust speech recognition of Conversational Telephone Speech
    Chen, Gang
    Tolba, Hesham
    O'Shaughnessy, Douglas
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1101 - 1104
  • [35] Residual noise compensation for robust speech recognition in nonstationary noise
    Yao, KS
    Shi, BE
    Fung, P
    Cao, ZG
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1125 - 1128
  • [36] Multi-Noise Representation Learning for Robust Speaker Recognition
    Cho, Sunyoung
    Wee, Kyungchul
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 681 - 685
  • [37] Curriculum Learning Based Approaches for Noise Robust Speaker Recognition
    Ranjan, Shivesh
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (01) : 197 - 210
  • [38] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
    Zhou, Xi
    Fu, Yun
    Liu, Ming
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
  • [39] Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification
    Shih, Po-Yi
    Lin, Po-Chuan
    Wang, Jhing-Fa
    Lin, Yuan-Ning
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (05) : 1459 - 1467
  • [40] Eigen-MLLR Environment/Speaker Compensation for Robust Speech Recognition
    Liao, Yuan-Fu
    Fang, Hung-Hsiang
    Hsu, Chi-Hui
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1249 - 1252