Estimation of general identifiable linear dynamic models with an application in speech recognition

被引：0

作者：

Tsontzos, G. ^{[1
]}

Diakoloukas, V. ^{[1
]}

Koniaris, Ch. ^{[1
]}

Digalakis, V. ^{[1
]}

机构：

[1] Tech Univ Crete, Dept Elect & Comp Engn, GR-73100 Khania, Greece

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年

关键词：

speech recognition; modeling; identification;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Although Hidden Markov Models (HMMs) provide a relatively efficient modeling framework for speech recognition, they suffer from several shortcomings which set upper bounds in the performance that can be achieved. Alternatively, linear dynamic models (LDM) can be used to model speech segments. Several implementations of LDM have been proposed in the literature. However, all had a restricted structure to satisfy identifiability constraints. In this paper, we relax all these constraints and use a general, canonical form for a linear state-space system that guarantees identifiability for arbitrary state and observation vector dimensions. For this system, we present a novel, element-wise Maximum Likelihood (ML) estimation method. Classification experiments on the AURORA2 speech database show performance gains compared to HMMs, particularly on highly noisy conditions.

引用

页码：453 / +

页数：2

共 50 条

[1] Speech recognition using linear dynamic models
Frankel, Joe
King, Simon
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 246 - 256
[2] Continuous speech recognition using linear dynamic models
Ma, Tao
Srinivasan, Sundararajan
Lazarou, Georgios
Picone, Joseph
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 11 - 16
[3] Estimation of mixtures of stochastic dynamic trajectories: Application to continuous speech recognition
Afify, M
Gong, YF
Haton, JP
COMPUTER SPEECH AND LANGUAGE, 1996, 10 (01): : 23 - 36
[4] Switching linear dynamic models for noise robust in-car speech recognition
Schuller, Bjoern
Woellmer, Martin
Moosmayr, Tobias
Ruske, Guenther
Rigoll, Gerhard
PATTERN RECOGNITION, 2008, 5096 : 244 - +
[5] On the Exploitation of Hidden Markov Models and Linear Dynamic Models in a Hybrid Decoder Architecture for Continuous Speech Recognition
Leutnant, Volker
Haeb-Umbach, Reinhold
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2946 - 2949
[6] ML Estimation of a Stochastic Linear System with the EM Algorithm and Its Application to Speech Recognition
Digalakis, V.
Rohlicek, J. R.
Ostendorf, M.
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (04): : 431 - 442
[7] Shrinkage estimation in general linear models
An, Lihua
Nkurunziza, Severien
Fung, Karen Y.
Krewski, Daniel
Luginaah, Isaac
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (07) : 2537 - 2549
[8] Speech Emotion Recognition Based on Dynamic Models
Lv, Guoyun
Hu, Shuixian
Lu, Xipan
2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 480 - 484
[9] Identifiable reparametrizations of linear compartment models
Meshkat, Nicolette
Sullivant, Seth
JOURNAL OF SYMBOLIC COMPUTATION, 2014, 63 : 46 - 67
[10] The Application of Hidden Markov Models in Speech Recognition
Gales, Mark
Young, Steve
FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03): : 195 - 304

← 1 2 3 4 5 →