Fast Speech Keyword Recognition Based on Improved Filler Model

被引:0
|
作者
Wang, Yang [1 ,2 ]
Yang, Jie [1 ,2 ]
Zhang, Le [3 ]
机构
[1] Wuhan Univ Technol, Coll Informat Engn, Wuhan, Hubei, Peoples R China
[2] Wuhan Univ Technol, Key Lab Fiber Opt Sensing Technol & Informat Proc, Minist Educ, Wuhan, Hubei, Peoples R China
[3] Univ Sheffield, Dept Elect & Elect Engn, Sheffield, S Yorkshire, England
基金
中国国家自然科学基金;
关键词
spoken keywords detection; filler model; HMM; LDA;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most traditional template matching based keyword recognition methods don't need training data, just rely on frame matching. However, the recognition speed is relatively slow and it can't be used in practice. The LVCSR-based method needs to convert the speech signal into text signal before recognition, which has an important impact on the final recognition performance. In this paper, we propose a method based on the filler model framework, which selects the syllable instead of using words as the modelling unit. The search space of our method is composed of all the syllables rather than words. By fixing a part of the Hidden Markov Model (HMM) state probability matrix parameters, our method can obtain important model parameters for a more sufficient training. Meanwhile, a two-stage model training strategy is proposed to reduce the artificial markings of training speech and Linear Discriminant Analysis (LDA) is introduced to improve the efficiency of system identification. Experimental results show that our method can effectively improve the detection rate of keywords and achieve similar detection time under the same conditions.
引用
收藏
页码:530 / 534
页数:5
相关论文
共 50 条
  • [41] An Improved Parallel Model Combination Method for Noisy Speech Recognition
    Veisi, Hadi
    Sameti, Hossein
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 237 - 242
  • [42] An improved noisy channel model for speech recognition error correction
    Li, Baoxiang
    Liu, Gang
    Guo, Jun
    Lu, Yueming
    International Journal of Advancements in Computing Technology, 2012, 4 (12) : 110 - 118
  • [43] Improved hybrid ANN/HMM model applies in speech recognition
    Xi, XJ
    Lin, KH
    ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 1373 - 1376
  • [44] An improved model of masking effects for robust speech recognition system
    Dai, Peng
    Soon, Ing Yann
    SPEECH COMMUNICATION, 2013, 55 (03) : 387 - 396
  • [45] Improved hidden Markov model for speech recognition and POS tagging
    袁里驰
    Journal of Central South University, 2012, 19 (02) : 511 - 516
  • [46] Improved hidden Markov model for speech recognition and POS tagging
    Li-chi Yuan
    Journal of Central South University, 2012, 19 : 511 - 516
  • [47] Improved hidden Markov model for speech recognition and POS tagging
    Yuan Li-chi
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2012, 19 (02) : 511 - 516
  • [48] FAST SEGMENT SEARCH FOR CORPUS-BASED SPEECH ENHANCEMENT BASED ON SPEECH RECOGNITION TECHNOLOGY
    Ogawa, Atsunori
    Kinoshita, Keisuke
    Hori, Takaaki
    Nakatani, Tomohiro
    Nakamura, Atsushi
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [49] Low-Resource Speech Recognition and Keyword-Spotting
    Gales, Mark J. F.
    Knill, Kate M.
    Ragni, Anton
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 3 - 19
  • [50] Keyword Spotting for Google Assistant Using Contextual Speech Recognition
    Michaely, Assaf Hurwitz
    Zhang, Xuedong
    Simko, Gabor
    Parada, Carolina
    Aleksic, Petar
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 272 - 278