Fast Speech Keyword Recognition Based on Improved Filler Model

被引:0
|
作者
Wang, Yang [1 ,2 ]
Yang, Jie [1 ,2 ]
Zhang, Le [3 ]
机构
[1] Wuhan Univ Technol, Coll Informat Engn, Wuhan, Hubei, Peoples R China
[2] Wuhan Univ Technol, Key Lab Fiber Opt Sensing Technol & Informat Proc, Minist Educ, Wuhan, Hubei, Peoples R China
[3] Univ Sheffield, Dept Elect & Elect Engn, Sheffield, S Yorkshire, England
基金
中国国家自然科学基金;
关键词
spoken keywords detection; filler model; HMM; LDA;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most traditional template matching based keyword recognition methods don't need training data, just rely on frame matching. However, the recognition speed is relatively slow and it can't be used in practice. The LVCSR-based method needs to convert the speech signal into text signal before recognition, which has an important impact on the final recognition performance. In this paper, we propose a method based on the filler model framework, which selects the syllable instead of using words as the modelling unit. The search space of our method is composed of all the syllables rather than words. By fixing a part of the Hidden Markov Model (HMM) state probability matrix parameters, our method can obtain important model parameters for a more sufficient training. Meanwhile, a two-stage model training strategy is proposed to reduce the artificial markings of training speech and Linear Discriminant Analysis (LDA) is introduced to improve the efficiency of system identification. Experimental results show that our method can effectively improve the detection rate of keywords and achieve similar detection time under the same conditions.
引用
收藏
页码:530 / 534
页数:5
相关论文
共 50 条
  • [21] Based on STM32 of CNN Speech Keyword Command Recognition System
    KUANG Wenbo
    LUO Weiping
    Instrumentation, 2023, 10 (01) : 17 - 22
  • [22] Study of the design and implementation of speech keyword recognition system based on streaming media
    Zhang Chenyan
    Lu Shuqin
    Sun Chengli
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 760 - 763
  • [23] Improvements on Speech Recognition for Fast Speech
    Lee, Ki-Seung
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2006, 25 (02): : 88 - 95
  • [24] CSF images fast recognition model based on improved convolutional Neural Network
    Huang, Wenming
    Leng, Jinqiang
    Deng, Zhenrong
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 516 - 522
  • [25] SPELL MY NAME: KEYWORD BOOSTED SPEECH RECOGNITION
    Jung, Namkyu
    Kim, Geonmin
    Chung, Joon Son
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6642 - 6646
  • [26] Deep Autoencoder based Speech Features for Improved Dysarthric Speech Recognition
    Vachhani, Bhavik
    Bhat, Chitralekha
    Das, Biswajit
    Kopparapu, Sunil Kumar
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1854 - 1858
  • [27] A Metaphor Recognition Model based on LSTM and Keyword Similarity Computation
    Chen, Zhiheng
    Fu, Lijun
    Wang, Hongjun
    Liu, YuJiang
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2347 - 2352
  • [28] An anti-noise speech recognition model based on improved Wiener filter and PUM
    Lu Xuanmin
    Gao Yue
    Xiao Peng
    Duan Chao
    PROCEEDINGS OF 2016 SIXTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2016), 2016, : 5 - 10
  • [29] An improved maximum model distance approach for HMM-based speech recognition systems
    He, QH
    Kwong, S
    Man, KF
    Tang, KS
    PATTERN RECOGNITION, 2000, 33 (10) : 1749 - 1758
  • [30] Research on Acoustic Model of Speech Recognition Based on Neural Network with Improved Gating Unit
    Liu, Wei
    Yan, Yan
    Yu, Jianqiang
    Sun, Yiming
    2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 2364 - 2368