A kind of continuous digit speech recognition method

被引：0

作者：

Cao, WM ^{[1
]}

机构：

[1] Zhejiang Univ Technol, Inst Intelligent Informat Syst, Informat Coll, Hangzhou 310032, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS II | 2005年 / 187卷

关键词：

high-dimension space; high-dimension space covering theory; continuous speech of speaker-independent;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the light of descriptive geometry and notions in set theory, this paper redefines the basic elements in space such as curve and surface and so on, presents some fundamental notions with respect to the point cover based oil the High-Dimension Space(HDS) point covering theory, finally takes points from mapping part of speech signals to HDS, so as to analyze distribution information of these speech points in HDS, and various geometric covering objects for speech points and their relationship. Besides, this paper also proposes a new algorithm for speaker independent continuous digit speech recognition based on the HDS point dynamic searching theory without endpoints detection and segmentation. First from the different digit syllables in real continuous digit speech, we establish the covering area in feature space for continuous speech. During recognition, we make use of the point covering dynamic searching theory in HDS to do recognition, and then get the satisfying recognized results. At last, compared to HMM-based method, from the development trend of the comparing results, as sample amount increasing, the difference of recognition rate between two methods will decrease slowly, while sample amount approaching to be very large, two recognition rates all close to 100% little by little. As seen from the results, the recognition rate of HDS point covering method is higher than that of in HMM-based method, because, the point covering describes the morphological distribution for speech in HDS, whereas HMM-based method is only a probability distribution. whose accuracy is certainly inferior to point covering.

引用

页码：213 / 222

页数：10

共 50 条

[21] Information geometry theory of high-dimension space and application for speaker independent continuous digit speech recognition
Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China
不详
不详
Chin J Electron, 2006, 4 A (768-784):
[22] Robust continuous digit recognition using Reservoir Computing
Jalalvand, Azarakhsh
Triefenbach, Fabian
Demuynck, Kris
Martens, Jean-Pierre
COMPUTER SPEECH AND LANGUAGE, 2015, 30 (01): : 135 - 158
[23] HMM/ANN system for Vietnamese continuous digit recognition
Duc, DN
Hosom, JP
Mai, LC
DEVELOPMENTS IN APPLIED ARTIFICIAL INTELLIGENCE, 2003, 2718 : 481 - 486
[24] The time-sliced paradigm - A connectionist method for continuous speech recognition
Kirschning, I
Tomabechi, H
Koyama, M
Aoe, JI
INFORMATION SCIENCES, 1996, 93 (1-2) : 133 - 158
[25] Novel statistical language modeling method for continuous Chinese speech recognition
Tian, Bin
Tian, Hongxin
Fu, Qiang
Yi, Kechu
International Conference on Signal Processing Proceedings, ICSP, 1998, 1 : 734 - 737
[26] A novel statistical language modeling method for continuous Chinese speech recognition
Tian, B
Tian, HX
Fu, Q
Yi, KC
ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 734 - 737
[27] A modular RNN-based method for continuous Mandarin speech recognition
Liao, YF
Chen, SH
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 252 - 263
[28] Extensions to the word graph method for large vocabulary continuous speech recognition
Ney, H
Ortmanns, S
Lindam, I
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1791 - 1794
[29] Speech adaptation using neural networks for connected digit recognition
Cheng, XL
Wang, H
Li, ZG
ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2401 - 2404
[30] AUDIO-VISUAL ISOLATED DIGIT RECOGNITION FOR WHISPERED SPEECH
Fan, Xing
Busso, Carlos
Hansen, John H. L.
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1500 - 1503

← 1 2 3 4 5 →