Fast Speech Keyword Recognition Based on Improved Filler Model

被引：0

作者：

Wang, Yang ^{[1
,2
]}

Yang, Jie ^{[1
,2
]}

Zhang, Le ^{[3
]}

机构：

[1] Wuhan Univ Technol, Coll Informat Engn, Wuhan, Hubei, Peoples R China

[2] Wuhan Univ Technol, Key Lab Fiber Opt Sensing Technol & Informat Proc, Minist Educ, Wuhan, Hubei, Peoples R China

[3] Univ Sheffield, Dept Elect & Elect Engn, Sheffield, S Yorkshire, England

来源：

2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC) | 2017年

基金：

中国国家自然科学基金;

关键词：

spoken keywords detection; filler model; HMM; LDA;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Most traditional template matching based keyword recognition methods don't need training data, just rely on frame matching. However, the recognition speed is relatively slow and it can't be used in practice. The LVCSR-based method needs to convert the speech signal into text signal before recognition, which has an important impact on the final recognition performance. In this paper, we propose a method based on the filler model framework, which selects the syllable instead of using words as the modelling unit. The search space of our method is composed of all the syllables rather than words. By fixing a part of the Hidden Markov Model (HMM) state probability matrix parameters, our method can obtain important model parameters for a more sufficient training. Meanwhile, a two-stage model training strategy is proposed to reduce the artificial markings of training speech and Linear Discriminant Analysis (LDA) is introduced to improve the efficiency of system identification. Experimental results show that our method can effectively improve the detection rate of keywords and achieve similar detection time under the same conditions.

引用

页码：530 / 534

页数：5

共 50 条

[21] Based on STM32 of CNN Speech Keyword Command Recognition System
KUANG Wenbo
LUO Weiping
Instrumentation, 2023, 10 (01) : 17 - 22
[22] Study of the design and implementation of speech keyword recognition system based on streaming media
Zhang Chenyan
Lu Shuqin
Sun Chengli
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 760 - 763
[23] Improvements on Speech Recognition for Fast Speech
Lee, Ki-Seung
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2006, 25 (02): : 88 - 95
[24] CSF images fast recognition model based on improved convolutional Neural Network
Huang, Wenming
Leng, Jinqiang
Deng, Zhenrong
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 516 - 522
[25] SPELL MY NAME: KEYWORD BOOSTED SPEECH RECOGNITION
Jung, Namkyu
Kim, Geonmin
Chung, Joon Son
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6642 - 6646
[26] Deep Autoencoder based Speech Features for Improved Dysarthric Speech Recognition
Vachhani, Bhavik
Bhat, Chitralekha
Das, Biswajit
Kopparapu, Sunil Kumar
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1854 - 1858
[27] A Metaphor Recognition Model based on LSTM and Keyword Similarity Computation
Chen, Zhiheng
Fu, Lijun
Wang, Hongjun
Liu, YuJiang
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2347 - 2352
[28] An anti-noise speech recognition model based on improved Wiener filter and PUM
Lu Xuanmin
Gao Yue
Xiao Peng
Duan Chao
PROCEEDINGS OF 2016 SIXTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2016), 2016, : 5 - 10
[29] An improved maximum model distance approach for HMM-based speech recognition systems
He, QH
Kwong, S
Man, KF
Tang, KS
PATTERN RECOGNITION, 2000, 33 (10) : 1749 - 1758
[30] Research on Acoustic Model of Speech Recognition Based on Neural Network with Improved Gating Unit
Liu, Wei
Yan, Yan
Yu, Jianqiang
Sun, Yiming
2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 2364 - 2368

← 1 2 3 4 5 →