Contextual invariant-integration features for improved speaker-independent speech recognition

被引:18
|
作者
Mueller, Florian [1 ]
Mertins, Alfred [1 ]
机构
[1] Med Univ Lubeck, Inst Signal Proc, D-23538 Lubeck, Germany
关键词
Speech recognition; Speaker-independency; Invariant-integration; TRANSFORMATION;
D O I
10.1016/j.specom.2011.02.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work presents a feature-extraction method that is based on the theory of invariant integration. The invariant-integration features are derived from an extended time period, and their computation has a very low complexity. Recognition experiments show a superior performance of the presented feature type compared to cepstral coefficients using a mel filterbank (MFCCs) or a gammatone filterbank (GTCCs) in matching as well as in mismatching training-testing conditions. Even without any speaker adaptation, the presented features yield accuracies that are larger than for MFCCs combined with vocal tract length normalization (VTLN) in matching training-test conditions. Also, it is shown that the invariant-integration features (IIFs) can be successfully combined with additional speaker-adaptation methods to further increase the accuracy. In addition to standard MFCCs also contextual MFCCs are introduced. Their performance lies between the one of MFCCs and IIFs. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:830 / 841
页数:12
相关论文
共 50 条
  • [31] Articulatory and bottleneck features for speaker-independent ASR of dysarthric speech
    Yilmaz, Emre
    Mitra, Vikramjit
    Sivaraman, Ganesh
    Franco, Horacio
    COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 319 - 334
  • [32] A speaker-independent continuous speech recognition system using biomimetic pattern recognition
    Wang Shoujue
    Qin Hong
    CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (03): : 460 - 462
  • [33] Computer-independent and speaker-independent real time speech recognition system
    Dianxin Kexue/Telecommunications Science, 13 (11): : 28 - 31
  • [34] Practical speaker-independent voice recognition using segmental features
    Kimura, T
    Ashida, A
    Niyada, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2004, 87 (02): : 73 - 81
  • [35] SPEAKER-INDEPENDENT CONTINUOUS SPEECH DICTATION
    GAUVAIN, JL
    LAMEL, LF
    ADDA, G
    ADDADECKER, M
    SPEECH COMMUNICATION, 1994, 15 (1-2) : 21 - 37
  • [36] The study on continuous speech of speaker-independent
    Ye Hong
    CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A): : 921 - 924
  • [37] HMM-based integrated method for speaker-independent speech recognition
    Tsinghua Univ, Beijing, China
    Int Conf Signal Process Proc, (613-616):
  • [38] SPEAKER-INDEPENDENT SPEECH RECOGNITION UNIT DEVELOPMENT FOR TELEPHONE LINE USE
    ISHII, N
    IMAI, Y
    NAKATSU, R
    ANDO, M
    JAPAN TELECOMMUNICATIONS REVIEW, 1982, 24 (03): : 267 - 274
  • [39] REFERENCE TEMPLATE ADAPTATION IN SPEAKER-INDEPENDENT ISOLATED WORD SPEECH RECOGNITION
    MCINNES, FR
    JACK, MA
    ELECTRONICS LETTERS, 1987, 23 (24) : 1304 - 1305
  • [40] NORMALIZING THE VOCAL-TRACT LENGTH FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
    LIN, QG
    CHE, CW
    IEEE SIGNAL PROCESSING LETTERS, 1995, 2 (11) : 201 - 203