Adaptation of Hidden Markov Models for Recognizing Speech of Reduced Frame Rate

被引:20
|
作者
Lee, Lee-Min [1 ]
Jean, Fu-Rong [2 ]
机构
[1] Dayeh Univ, Dept Elect Engn, Changhua 51591, Taiwan
[2] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan
关键词
Adaptation; distributed speech recognition (DSR); hidden Markov model (HMM); reduced frame rate (RFR); RECOGNITION;
D O I
10.1109/TCYB.2013.2240450
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The frame rate of the observation sequence in distributed speech recognition applications may be reduced to suit a resource-limited front-end device. In order to use models trained using full-frame-rate data in the recognition of reduced-frame-rate (RFR) data, we propose a method for adapting the transition probabilities of hidden Markov models (HMMs) to match the frame rate of the observation. Experiments on the recognition of clean and noisy connected digits are conducted to evaluate the proposed method. Experimental results show that the proposed method can effectively compensate for the frame-rate mismatch between the training and the test data. Using our adapted model to recognize the RFR speech data, one can significantly reduce the computation time and achieve the same level of accuracy as that of a method, which restores the frame rate using data interpolation.
引用
收藏
页码:2114 / 2121
页数:8
相关论文
共 50 条
  • [31] BAYESIAN SENSING HIDDEN MARKOV MODELS FOR SPEECH RECOGNITION
    Saon, George
    Chien, Jen-Tzung
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5056 - 5059
  • [32] Speech emotion recognition using hidden Markov models
    Nwe, TL
    Foo, SW
    De Silva, LC
    SPEECH COMMUNICATION, 2003, 41 (04) : 603 - 623
  • [33] Speech animation using coupled hidden Markov models
    Xie, Lei
    Liu, Zhi-Qiang
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 1128 - +
  • [34] Fuzzy hidden Markov models for speech and speaker recognition
    Tran, Dat
    Wagner, Michael
    Annual Conference of the North American Fuzzy Information Processing Society - NAFIPS, 1999, : 426 - 430
  • [35] Speech defect analysis using Hidden Markov Models
    Chaloupka, Zdenek
    Uhlir, Jan
    RADIOENGINEERING, 2007, 16 (01) : 67 - 72
  • [36] REVISITING HIDDEN MARKOV MODELS FOR SPEECH EMOTION RECOGNITION
    Mao, Shuiyang
    Tao, Dehua
    Zhang, Guangyan
    Ching, P. C.
    Lee, Tan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6715 - 6719
  • [37] IMPROVED HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION
    AUBERT, X
    BOURLARD, H
    KAMP, Y
    WELLEKENS, CJ
    PHILIPS JOURNAL OF RESEARCH, 1988, 43 (3-4) : 224 - 245
  • [38] Fuzzy Hidden Markov Models for Indonesian Speech Classification
    Yulita, Intan Nurma
    The, Houw Liong
    Adiwijaya
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2012, 16 (03) : 381 - 387
  • [39] Modeling and recognizing human trajectories with beta process hidden Markov models
    Sun, Shiliang
    Zhao, Jing
    Gao, Qingbin
    PATTERN RECOGNITION, 2015, 48 (08) : 2407 - 2417
  • [40] Using hidden Markov models for recognizing action primitives in complex actions
    Kruger, Volker
    Grest, Daniel
    IMAGE ANALYSIS, PROCEEDINGS, 2007, 4522 : 203 - +