Invariant-integration method for robust feature extraction in speaker-independent speech recognition

被引:0
|
作者
Mueller, Florian [1 ]
Mertins, Alfred [1 ]
机构
[1] Univ Lubeck, Inst Signal Proc, Lubeck, Germany
关键词
speech recognition; speaker-independency; invariant integration; monomials; HIDDEN MARKOV-MODELS; NORMALIZATION; TRANSFORM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The vocal tract length (VTL) is one of the variabilities that speaker-independent automatic speech recognition (ASR) systems encounter. Standard methods to compensate for the effects of different VTLs within the processing stages of the ASR systems often have a high computational effort. By using an appropriate warping scheme for the frequency centers of the time-frequency analysis, a change in VTL can be approximately described by a translation in the subband-index space. We present a new type of features that is based on the principle of invariant integration, and an according feature selection method is described. ASR experiments show the increased robustness of the proposed features in comparison to standard MFCCs.
引用
收藏
页码:2939 / 2942
页数:4
相关论文
共 50 条
  • [1] Contextual invariant-integration features for improved speaker-independent speech recognition
    Mueller, Florian
    Mertins, Alfred
    SPEECH COMMUNICATION, 2011, 53 (06) : 830 - 841
  • [2] Noise Robust Speaker-Independent Speech Recognition with Invariant-Integration Features Using Power-Bias Subtraction
    Mueller, Florian
    Mertins, Alfred
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1688 - 1691
  • [3] Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition
    Lu, Cheng
    Zong, Yuan
    Zheng, Wenming
    Li, Yang
    Tang, Chuangao
    Schuller, Bjoern W.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2217 - 2230
  • [4] A novel robust feature of speech signal based on the mellin transform for speaker-independent speech recognition
    Chen, JD
    Xu, B
    Huang, TY
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 629 - 632
  • [5] A FEATURE SELECTION AND FEATURE FUSION COMBINATION METHOD FOR SPEAKER-INDEPENDENT SPEECH EMOTION RECOGNITION
    Jin, Yun
    Song, Peng
    Zheng, Wenming
    Zhao, Li
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [6] Robust Features for Speaker-Independent Speech Recognition Based on a Certain Class of Translation-Invariant Transformations
    Mueller, Florian
    Mertins, Alfred
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 111 - 119
  • [7] Biomimetic pattern recognition for speaker-independent speech recognition
    Qin, H
    Wang, SJ
    Sun, H
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1290 - 1294
  • [8] Predictor codebook for speaker-independent speech recognition
    Kawabata, Takeshi
    Systems and Computers in Japan, 1994, 25 (01): : 37 - 46
  • [9] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
    Nazari, Mohammad
    Sayadiyan, Abolghasem
    Valiollahzadeh, Seyyed Majid
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676
  • [10] PREDICTOR CODEBOOK FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
    KAWABATA, T
    SYSTEMS AND COMPUTERS IN JAPAN, 1994, 25 (01) : 37 - 46