Invariant-integration method for robust feature extraction in speaker-independent speech recognition

被引：0

作者：

Mueller, Florian ^{[1
]}

Mertins, Alfred ^{[1
]}

机构：

[1] Univ Lubeck, Inst Signal Proc, Lubeck, Germany

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

speech recognition; speaker-independency; invariant integration; monomials; HIDDEN MARKOV-MODELS; NORMALIZATION; TRANSFORM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The vocal tract length (VTL) is one of the variabilities that speaker-independent automatic speech recognition (ASR) systems encounter. Standard methods to compensate for the effects of different VTLs within the processing stages of the ASR systems often have a high computational effort. By using an appropriate warping scheme for the frequency centers of the time-frequency analysis, a change in VTL can be approximately described by a translation in the subband-index space. We present a new type of features that is based on the principle of invariant integration, and an according feature selection method is described. ASR experiments show the increased robustness of the proposed features in comparison to standard MFCCs.

引用

页码：2939 / 2942

页数：4

共 50 条

[11] Japanese Speaker-Independent Homonyms Speech Recognition
Murakami, Jin'ichi
Hotta, Haseo
COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 306 - 313
[12] The self-organizing feature map used for speaker-independent speech recognition
Yuan, L
Zhou, LQ
Liu, ZM
ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 733 - 736
[13] An integrated study of speaker normalisation and HMM adaptation for noise robust speaker-independent speech recognition
Hariharan, R
Viikki, O
SPEECH COMMUNICATION, 2002, 37 (3-4) : 349 - 361
[14] HMM-based integrated method for speaker-independent speech recognition
Tsinghua Univ, Beijing, China
Int Conf Signal Process Proc, (613-616):
[15] ON USING THE AUDITORY IMAGE MODEL AND INVARIANT-INTEGRATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
Mueller, Florian
Mertins, Alfred
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4905 - 4908
[16] A HMM-based integrated method for speaker-independent speech recognition
Zhang, YY
Zhu, XY
ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 613 - 616
[17] ON USING THE AUDITORY IMAGE MODEL AND INVARIANT-INTEGRATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
Mueller, Florian
Mertins, Alfred
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4905 - 4908
[18] Speaker-Independent Speech Recognition using Visual Features
Pooventhiran, G.
Sandeep, A.
Manthiravalli, K.
Harish, D.
Renuka, Karthika D.
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 616 - 620
[19] Generalized Cyclic Transformations in Speaker-Independent Speech Recognition
Mueller, Florian
Belilovsky, Eugene
Mertins, Alfred
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 211 - 215
[20] Improved Emotion Recognition With a Novel Speaker-Independent Feature
Kim, Eun Ho
Hyun, Kyung Hak
Kim, Soo Hyun
Kwak, Yoon Keun
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2009, 14 (03) : 317 - 325

← 1 2 3 4 5 →