Fast Speech Keyword Recognition Based on Improved Filler Model

被引：0

作者：

Wang, Yang ^{[1
,2
]}

Yang, Jie ^{[1
,2
]}

Zhang, Le ^{[3
]}

机构：

[1] Wuhan Univ Technol, Coll Informat Engn, Wuhan, Hubei, Peoples R China

[2] Wuhan Univ Technol, Key Lab Fiber Opt Sensing Technol & Informat Proc, Minist Educ, Wuhan, Hubei, Peoples R China

[3] Univ Sheffield, Dept Elect & Elect Engn, Sheffield, S Yorkshire, England

来源：

2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC) | 2017年

基金：

中国国家自然科学基金;

关键词：

spoken keywords detection; filler model; HMM; LDA;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Most traditional template matching based keyword recognition methods don't need training data, just rely on frame matching. However, the recognition speed is relatively slow and it can't be used in practice. The LVCSR-based method needs to convert the speech signal into text signal before recognition, which has an important impact on the final recognition performance. In this paper, we propose a method based on the filler model framework, which selects the syllable instead of using words as the modelling unit. The search space of our method is composed of all the syllables rather than words. By fixing a part of the Hidden Markov Model (HMM) state probability matrix parameters, our method can obtain important model parameters for a more sufficient training. Meanwhile, a two-stage model training strategy is proposed to reduce the artificial markings of training speech and Linear Discriminant Analysis (LDA) is introduced to improve the efficiency of system identification. Experimental results show that our method can effectively improve the detection rate of keywords and achieve similar detection time under the same conditions.

引用

页码：530 / 534

页数：5

共 50 条

[1] Keyword recognition based on twice fusion of Posteriorgram and filler model
Chen T.-B.
Zhang C.-F.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (06): : 1170 - 1176
[2] FALSE ALARM REDUCTION BY IMPROVED FILLER MODEL AND POST-PROCESSING IN SPEECH KEYWORD SPOTTING
Tavanaei, Amirhossein
Sameti, Hossein
Mohammadi, Seyyed Hamidreza
2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
[3] Evaluating Spoken Language Model Based on Filler Prediction Model in Speech Recognition
Ohta, Kengo
Tsuchiya, Masatoshi
Nakagawa, Seiichi
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1558 - +
[4] KEYWORD DETECTION IN CONVERSATIONAL SPEECH UTTERANCES USING HIDDEN MARKOV MODEL-BASED CONTINUOUS SPEECH RECOGNITION
ROSE, RC
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04): : 309 - 333
[5] Competing set based verification method in speech keyword recognition
Sun, Cheng-Li
Liu, Gang
Guo, Jun
PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3349 - 3353
[6] An improved HMM speech recognition model
Yuan, Lichi
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1311 - 1315
[7] Fast Keyword Spotting in Telephone Speech
Nouza, Jan
Silovsky, Jan
RADIOENGINEERING, 2009, 18 (04) : 665 - 670
[8] Improved lattice-based speech keyword spotting algorithm
Department of Electronic Engineer, Tsinghua University, Beijing
100084, China
Qinghua Daxue Xuebao, 5 (508-513): : 508 - 513
[9] Keyword Guided Target Speech Recognition
Shi, Ying
Li, Lantian
Wang, Dong
Han, Jiqing
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1945 - 1949
[10] Model-based Articulatory Phonetic Features for Improved Speech Recognition
Huang, Guangpu
Er, Meng Joo
2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,

← 1 2 3 4 5 →