Fast Speech Keyword Recognition Based on Improved Filler Model

被引:0
|
作者
Wang, Yang [1 ,2 ]
Yang, Jie [1 ,2 ]
Zhang, Le [3 ]
机构
[1] Wuhan Univ Technol, Coll Informat Engn, Wuhan, Hubei, Peoples R China
[2] Wuhan Univ Technol, Key Lab Fiber Opt Sensing Technol & Informat Proc, Minist Educ, Wuhan, Hubei, Peoples R China
[3] Univ Sheffield, Dept Elect & Elect Engn, Sheffield, S Yorkshire, England
基金
中国国家自然科学基金;
关键词
spoken keywords detection; filler model; HMM; LDA;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most traditional template matching based keyword recognition methods don't need training data, just rely on frame matching. However, the recognition speed is relatively slow and it can't be used in practice. The LVCSR-based method needs to convert the speech signal into text signal before recognition, which has an important impact on the final recognition performance. In this paper, we propose a method based on the filler model framework, which selects the syllable instead of using words as the modelling unit. The search space of our method is composed of all the syllables rather than words. By fixing a part of the Hidden Markov Model (HMM) state probability matrix parameters, our method can obtain important model parameters for a more sufficient training. Meanwhile, a two-stage model training strategy is proposed to reduce the artificial markings of training speech and Linear Discriminant Analysis (LDA) is introduced to improve the efficiency of system identification. Experimental results show that our method can effectively improve the detection rate of keywords and achieve similar detection time under the same conditions.
引用
收藏
页码:530 / 534
页数:5
相关论文
共 50 条
  • [1] Keyword recognition based on twice fusion of Posteriorgram and filler model
    Chen T.-B.
    Zhang C.-F.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (06): : 1170 - 1176
  • [2] FALSE ALARM REDUCTION BY IMPROVED FILLER MODEL AND POST-PROCESSING IN SPEECH KEYWORD SPOTTING
    Tavanaei, Amirhossein
    Sameti, Hossein
    Mohammadi, Seyyed Hamidreza
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [3] Evaluating Spoken Language Model Based on Filler Prediction Model in Speech Recognition
    Ohta, Kengo
    Tsuchiya, Masatoshi
    Nakagawa, Seiichi
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1558 - +
  • [4] KEYWORD DETECTION IN CONVERSATIONAL SPEECH UTTERANCES USING HIDDEN MARKOV MODEL-BASED CONTINUOUS SPEECH RECOGNITION
    ROSE, RC
    COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04): : 309 - 333
  • [5] Competing set based verification method in speech keyword recognition
    Sun, Cheng-Li
    Liu, Gang
    Guo, Jun
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3349 - 3353
  • [6] An improved HMM speech recognition model
    Yuan, Lichi
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1311 - 1315
  • [7] Fast Keyword Spotting in Telephone Speech
    Nouza, Jan
    Silovsky, Jan
    RADIOENGINEERING, 2009, 18 (04) : 665 - 670
  • [8] Improved lattice-based speech keyword spotting algorithm
    Department of Electronic Engineer, Tsinghua University, Beijing
    100084, China
    Qinghua Daxue Xuebao, 5 (508-513): : 508 - 513
  • [9] Keyword Guided Target Speech Recognition
    Shi, Ying
    Li, Lantian
    Wang, Dong
    Han, Jiqing
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1945 - 1949
  • [10] Model-based Articulatory Phonetic Features for Improved Speech Recognition
    Huang, Guangpu
    Er, Meng Joo
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,