Invariant-integration method for robust feature extraction in speaker-independent speech recognition

被引:0
|
作者
Mueller, Florian [1 ]
Mertins, Alfred [1 ]
机构
[1] Univ Lubeck, Inst Signal Proc, Lubeck, Germany
关键词
speech recognition; speaker-independency; invariant integration; monomials; HIDDEN MARKOV-MODELS; NORMALIZATION; TRANSFORM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The vocal tract length (VTL) is one of the variabilities that speaker-independent automatic speech recognition (ASR) systems encounter. Standard methods to compensate for the effects of different VTLs within the processing stages of the ASR systems often have a high computational effort. By using an appropriate warping scheme for the frequency centers of the time-frequency analysis, a change in VTL can be approximately described by a translation in the subband-index space. We present a new type of features that is based on the principle of invariant integration, and an according feature selection method is described. ASR experiments show the increased robustness of the proposed features in comparison to standard MFCCs.
引用
收藏
页码:2939 / 2942
页数:4
相关论文
共 50 条
  • [31] SPEAKER-INDEPENDENT WORD RECOGNITION IN CONNECTED SPEECH ON THE BASIS OF PHONEME RECOGNITION
    MAENOBU, K
    ARIKI, Y
    SAKAI, T
    INFORMATION SCIENCES, 1984, 33 (1-2) : 31 - 61
  • [32] Across-speaker Articulatory Normalization for Speaker-independent Silent Speech Recognition
    Wang, Jun
    Samal, Ashok
    Green, Jordan R.
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1179 - 1183
  • [33] An automatic speech recognition system with speaker-independent identification support
    Caranica, Alexandru
    Burileanu, Corneliu
    ADVANCED TOPICS IN OPTOELECTRONICS, MICROELECTRONICS, AND NANOTECHNOLOGIES VII, 2015, 9258
  • [35] Speaker-independent telephone speech recognition system: the VCS TeleRec
    Hunt, Alan
    Speech technology, 1988, 4 (02): : 80 - 82
  • [36] A SPEAKER-INDEPENDENT SPEECH RECOGNITION SYSTEM FOR TELEPHONE NETWORK APPLICATIONS
    TRNKA, R
    REVUE TECHNIQUE THOMSON-CSF, 1984, 16 (04): : 847 - 861
  • [37] Speaker-Independent Emotion Recognition based on Feature Vector Classification
    Park, Jeong-Sik
    Kim, Ji-Hwan
    Yoon, Sang-Min
    Oh, Yung-Hwan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2775 - +
  • [38] Speaker-independent speech recognition based on tree-structured speaker clustering
    Kosaka, T
    Matsunaga, S
    Sagayama, S
    COMPUTER SPEECH AND LANGUAGE, 1996, 10 (01): : 55 - 74
  • [39] Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition
    Fahad, Md Shah
    Ranjan, Ashish
    Deepak, Akshay
    Pradhan, Gayadhar
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (11) : 6113 - 6135
  • [40] Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition
    Md Shah Fahad
    Ashish Ranjan
    Akshay Deepak
    Gayadhar Pradhan
    Circuits, Systems, and Signal Processing, 2022, 41 : 6113 - 6135