Estimation of general identifiable linear dynamic models with an application in speech recognition

被引:0
|
作者
Tsontzos, G. [1 ]
Diakoloukas, V. [1 ]
Koniaris, Ch. [1 ]
Digalakis, V. [1 ]
机构
[1] Tech Univ Crete, Dept Elect & Comp Engn, GR-73100 Khania, Greece
关键词
speech recognition; modeling; identification;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Although Hidden Markov Models (HMMs) provide a relatively efficient modeling framework for speech recognition, they suffer from several shortcomings which set upper bounds in the performance that can be achieved. Alternatively, linear dynamic models (LDM) can be used to model speech segments. Several implementations of LDM have been proposed in the literature. However, all had a restricted structure to satisfy identifiability constraints. In this paper, we relax all these constraints and use a general, canonical form for a linear state-space system that guarantees identifiability for arbitrary state and observation vector dimensions. For this system, we present a novel, element-wise Maximum Likelihood (ML) estimation method. Classification experiments on the AURORA2 speech database show performance gains compared to HMMs, particularly on highly noisy conditions.
引用
收藏
页码:453 / +
页数:2
相关论文
共 50 条
  • [41] Voicing-character estimation of speech spectra:: Application to noise robust speech recognition
    Jancovic, Peter
    Kokuer, Munevver
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 257 - 260
  • [42] Predictive models for sequence modelling, application to speech and character recognition
    Gallinari, P
    ADAPTIVE PROCESSING OF SEQUENCES AND DATA STRUCTURES, 1998, 1387 : 418 - 434
  • [43] Competitive Robust Estimation for Uncertain Linear Dynamic Models
    Correa, Gilberto Oliveira
    Talavera, Alvaro
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (18) : 4847 - 4861
  • [44] ANALYTIC DERIVATIVES FOR ESTIMATION OF LINEAR DYNAMIC-MODELS
    ZADROZNY, PA
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1989, 18 (6-7) : 539 - 553
  • [45] Localized Bayes estimation for non-identifiable models
    Takamatsu, Shingo
    Nakajima, Shinichi
    Watanabe, Sumio
    NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2006, 4232 : 650 - 659
  • [46] Estimation of Nonparametric Noise Models for Linear Dynamic Systems
    Schoukens, Johan
    Pintelon, Rik
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2009, 58 (08) : 2468 - 2474
  • [47] GQL estimation in linear dynamic models for panel data
    Sun, Bingrui
    Sutradhar, Brajendra C.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2013, 83 (03) : 568 - 580
  • [48] Quality Estimation of Speech Recognition Features for Dynamic Time Warping Classifier
    Lileikyte, Rasa
    Telksnys, Laimutis
    INFORMATION TECHNOLOGY AND CONTROL, 2012, 41 (03): : 268 - 273
  • [49] Automatic Estimation of Scaling Factors Among Probabilistic Models in Speech Recognition
    Emori, Tadashi
    Onishi, Yoshifumi
    Shinoda, Koichi
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1229 - +
  • [50] Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
    Axelrod, Scott
    Goel, Vaibhava
    Gopinath, Ramesh
    Olsen, Peder
    Visweswariah, Karthik
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 172 - 189