Fast Speech Keyword Recognition Based on Improved Filler Model

被引:0
|
作者
Wang, Yang [1 ,2 ]
Yang, Jie [1 ,2 ]
Zhang, Le [3 ]
机构
[1] Wuhan Univ Technol, Coll Informat Engn, Wuhan, Hubei, Peoples R China
[2] Wuhan Univ Technol, Key Lab Fiber Opt Sensing Technol & Informat Proc, Minist Educ, Wuhan, Hubei, Peoples R China
[3] Univ Sheffield, Dept Elect & Elect Engn, Sheffield, S Yorkshire, England
基金
中国国家自然科学基金;
关键词
spoken keywords detection; filler model; HMM; LDA;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most traditional template matching based keyword recognition methods don't need training data, just rely on frame matching. However, the recognition speed is relatively slow and it can't be used in practice. The LVCSR-based method needs to convert the speech signal into text signal before recognition, which has an important impact on the final recognition performance. In this paper, we propose a method based on the filler model framework, which selects the syllable instead of using words as the modelling unit. The search space of our method is composed of all the syllables rather than words. By fixing a part of the Hidden Markov Model (HMM) state probability matrix parameters, our method can obtain important model parameters for a more sufficient training. Meanwhile, a two-stage model training strategy is proposed to reduce the artificial markings of training speech and Linear Discriminant Analysis (LDA) is introduced to improve the efficiency of system identification. Experimental results show that our method can effectively improve the detection rate of keywords and achieve similar detection time under the same conditions.
引用
收藏
页码:530 / 534
页数:5
相关论文
共 50 条
  • [31] Speech Keyword Spotting Method Based on Swin-Transformer Model
    Chengli Sun
    Bikang Chen
    Feilong Chen
    Yan Leng
    Qiaosheng Guo
    International Journal of Computational Intelligence Systems, 17
  • [32] Fast-LSTM Acoustic Model for Distant Speech Recognition
    Trianto, Rezki
    Tai, Tzu-Chiang
    Wang, Jia-Ching
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [33] A fast algorithm for parallel model combination for noisy speech recognition
    Hwang, TH
    Wang, HC
    COMPUTER SPEECH AND LANGUAGE, 2000, 14 (02): : 81 - 100
  • [34] Speech Recognition for Keyword Spotting using a Set of Modulation Based Features - Preliminary Results
    Gopalan, Kaliappan
    Chu, Tao
    IMCIC 2010: INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL II, 2010, : 32 - 36
  • [35] Confidence estimation and keyword extraction from speech recognition result based on Web information
    Kensuke, Hara
    Hideki, Sekiya
    Tetsuya, Kawase
    Satoshi, Tamura
    Satoru, Hayamizu
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [36] Improved Phoneme-Based Myoelectric Speech Recognition
    Zhou, Quan
    Jiang, Ning
    Englehart, Kevin
    Hudgins, Bernard
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2009, 56 (08) : 2016 - 2023
  • [37] Speech Recognition of Accented Mandarin Based on Improved Conformer
    Yang, Xing-Yao
    Zhang, Shao-Dong
    Xiao, Rui
    Yu, Jiong
    Li, Zi-Yang
    SENSORS, 2023, 23 (08)
  • [38] An improved speech recognition algorithm based on difference subspace
    Zhang, XY
    Wu, JP
    Zhang, QS
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 1116 - 1119
  • [39] Improved Keyword Spotting based on Keyword/Garbage Models
    Chen, Qiyu
    Zhang, Weibin
    Xu, Xiangmin
    Xing, Xiaofen
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [40] Robust recognition of fast speech
    Lee, Ki-Seung
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (08) : 2456 - 2459