Estimation of general identifiable linear dynamic models with an application in speech recognition

被引:0
|
作者
Tsontzos, G. [1 ]
Diakoloukas, V. [1 ]
Koniaris, Ch. [1 ]
Digalakis, V. [1 ]
机构
[1] Tech Univ Crete, Dept Elect & Comp Engn, GR-73100 Khania, Greece
关键词
speech recognition; modeling; identification;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Although Hidden Markov Models (HMMs) provide a relatively efficient modeling framework for speech recognition, they suffer from several shortcomings which set upper bounds in the performance that can be achieved. Alternatively, linear dynamic models (LDM) can be used to model speech segments. Several implementations of LDM have been proposed in the literature. However, all had a restricted structure to satisfy identifiability constraints. In this paper, we relax all these constraints and use a general, canonical form for a linear state-space system that guarantees identifiability for arbitrary state and observation vector dimensions. For this system, we present a novel, element-wise Maximum Likelihood (ML) estimation method. Classification experiments on the AURORA2 speech database show performance gains compared to HMMs, particularly on highly noisy conditions.
引用
收藏
页码:453 / +
页数:2
相关论文
共 50 条
  • [1] Speech recognition using linear dynamic models
    Frankel, Joe
    King, Simon
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 246 - 256
  • [2] Continuous speech recognition using linear dynamic models
    Ma, Tao
    Srinivasan, Sundararajan
    Lazarou, Georgios
    Picone, Joseph
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 11 - 16
  • [3] Estimation of mixtures of stochastic dynamic trajectories: Application to continuous speech recognition
    Afify, M
    Gong, YF
    Haton, JP
    COMPUTER SPEECH AND LANGUAGE, 1996, 10 (01): : 23 - 36
  • [4] Switching linear dynamic models for noise robust in-car speech recognition
    Schuller, Bjoern
    Woellmer, Martin
    Moosmayr, Tobias
    Ruske, Guenther
    Rigoll, Gerhard
    PATTERN RECOGNITION, 2008, 5096 : 244 - +
  • [5] On the Exploitation of Hidden Markov Models and Linear Dynamic Models in a Hybrid Decoder Architecture for Continuous Speech Recognition
    Leutnant, Volker
    Haeb-Umbach, Reinhold
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2946 - 2949
  • [6] ML Estimation of a Stochastic Linear System with the EM Algorithm and Its Application to Speech Recognition
    Digalakis, V.
    Rohlicek, J. R.
    Ostendorf, M.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (04): : 431 - 442
  • [7] Shrinkage estimation in general linear models
    An, Lihua
    Nkurunziza, Severien
    Fung, Karen Y.
    Krewski, Daniel
    Luginaah, Isaac
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (07) : 2537 - 2549
  • [8] Speech Emotion Recognition Based on Dynamic Models
    Lv, Guoyun
    Hu, Shuixian
    Lu, Xipan
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 480 - 484
  • [9] Identifiable reparametrizations of linear compartment models
    Meshkat, Nicolette
    Sullivant, Seth
    JOURNAL OF SYMBOLIC COMPUTATION, 2014, 63 : 46 - 67
  • [10] The Application of Hidden Markov Models in Speech Recognition
    Gales, Mark
    Young, Steve
    FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2007, 1 (03): : 195 - 304