Invariant-integration method for robust feature extraction in speaker-independent speech recognition

被引：0

作者：

Mueller, Florian ^{[1
]}

Mertins, Alfred ^{[1
]}

机构：

[1] Univ Lubeck, Inst Signal Proc, Lubeck, Germany

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

speech recognition; speaker-independency; invariant integration; monomials; HIDDEN MARKOV-MODELS; NORMALIZATION; TRANSFORM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The vocal tract length (VTL) is one of the variabilities that speaker-independent automatic speech recognition (ASR) systems encounter. Standard methods to compensate for the effects of different VTLs within the processing stages of the ASR systems often have a high computational effort. By using an appropriate warping scheme for the frequency centers of the time-frequency analysis, a change in VTL can be approximately described by a translation in the subband-index space. We present a new type of features that is based on the principle of invariant integration, and an according feature selection method is described. ASR experiments show the increased robustness of the proposed features in comparison to standard MFCCs.

引用

页码：2939 / 2942

页数：4

共 50 条

[41] A speaker-independent continuous speech recognition system using biomimetic pattern recognition
Wang Shoujue
Qin Hong
CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (03): : 460 - 462
[42] Computer-independent and speaker-independent real time speech recognition system
Dianxin Kexue/Telecommunications Science, 13 (11): : 28 - 31
[43] IMPROVED SPEAKER-INDEPENDENT EMOTION RECOGNITION FROM SPEECH USING TWO-STAGE FEATURE REDUCTION
Nazid, Hasrul Mohd
Muthusamy, Hariharan
Vijean, Vikneswaran
Yaacob, Sazali
JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2015, 14 : 57 - 76
[44] SPEAKER-INDEPENDENT CONTINUOUS SPEECH DICTATION
GAUVAIN, JL
LAMEL, LF
ADDA, G
ADDADECKER, M
SPEECH COMMUNICATION, 1994, 15 (1-2) : 21 - 37
[45] The study on continuous speech of speaker-independent
Ye Hong
CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A): : 921 - 924
[46] SPEAKER-INDEPENDENT SPEECH RECOGNITION UNIT DEVELOPMENT FOR TELEPHONE LINE USE
ISHII, N
IMAI, Y
NAKATSU, R
ANDO, M
JAPAN TELECOMMUNICATIONS REVIEW, 1982, 24 (03): : 267 - 274
[47] REFERENCE TEMPLATE ADAPTATION IN SPEAKER-INDEPENDENT ISOLATED WORD SPEECH RECOGNITION
MCINNES, FR
JACK, MA
ELECTRONICS LETTERS, 1987, 23 (24) : 1304 - 1305
[48] NORMALIZING THE VOCAL-TRACT LENGTH FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
LIN, QG
CHE, CW
IEEE SIGNAL PROCESSING LETTERS, 1995, 2 (11) : 201 - 203
[49] DSP-based large vocabulary speaker-independent speech recognition
Hirayama, H
Yoshida, K
Koga, S
Hattori, H
NEC RESEARCH & DEVELOPMENT, 1996, 37 (04): : 528 - 534
[50] SPEAKER-INDEPENDENT SPEECH-RECOGNITION SYSTEM BASED ON LINEAR PREDICTION
GUPTA, VN
BRYAN, JK
GOWDY, JN
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (01): : 27 - 33

← 1 2 3 4 5 →