A kind of continuous digit speech recognition method

被引:0
|
作者
Cao, WM [1 ]
机构
[1] Zhejiang Univ Technol, Inst Intelligent Informat Syst, Informat Coll, Hangzhou 310032, Peoples R China
关键词
high-dimension space; high-dimension space covering theory; continuous speech of speaker-independent;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the light of descriptive geometry and notions in set theory, this paper redefines the basic elements in space such as curve and surface and so on, presents some fundamental notions with respect to the point cover based oil the High-Dimension Space(HDS) point covering theory, finally takes points from mapping part of speech signals to HDS, so as to analyze distribution information of these speech points in HDS, and various geometric covering objects for speech points and their relationship. Besides, this paper also proposes a new algorithm for speaker independent continuous digit speech recognition based on the HDS point dynamic searching theory without endpoints detection and segmentation. First from the different digit syllables in real continuous digit speech, we establish the covering area in feature space for continuous speech. During recognition, we make use of the point covering dynamic searching theory in HDS to do recognition, and then get the satisfying recognized results. At last, compared to HMM-based method, from the development trend of the comparing results, as sample amount increasing, the difference of recognition rate between two methods will decrease slowly, while sample amount approaching to be very large, two recognition rates all close to 100% little by little. As seen from the results, the recognition rate of HDS point covering method is higher than that of in HMM-based method, because, the point covering describes the morphological distribution for speech in HDS, whereas HMM-based method is only a probability distribution. whose accuracy is certainly inferior to point covering.
引用
收藏
页码:213 / 222
页数:10
相关论文
共 50 条
  • [21] Information geometry theory of high-dimension space and application for speaker independent continuous digit speech recognition
    Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China
    不详
    不详
    Chin J Electron, 2006, 4 A (768-784):
  • [22] Robust continuous digit recognition using Reservoir Computing
    Jalalvand, Azarakhsh
    Triefenbach, Fabian
    Demuynck, Kris
    Martens, Jean-Pierre
    COMPUTER SPEECH AND LANGUAGE, 2015, 30 (01): : 135 - 158
  • [23] HMM/ANN system for Vietnamese continuous digit recognition
    Duc, DN
    Hosom, JP
    Mai, LC
    DEVELOPMENTS IN APPLIED ARTIFICIAL INTELLIGENCE, 2003, 2718 : 481 - 486
  • [24] The time-sliced paradigm - A connectionist method for continuous speech recognition
    Kirschning, I
    Tomabechi, H
    Koyama, M
    Aoe, JI
    INFORMATION SCIENCES, 1996, 93 (1-2) : 133 - 158
  • [25] Novel statistical language modeling method for continuous Chinese speech recognition
    Tian, Bin
    Tian, Hongxin
    Fu, Qiang
    Yi, Kechu
    International Conference on Signal Processing Proceedings, ICSP, 1998, 1 : 734 - 737
  • [26] A novel statistical language modeling method for continuous Chinese speech recognition
    Tian, B
    Tian, HX
    Fu, Q
    Yi, KC
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 734 - 737
  • [27] A modular RNN-based method for continuous Mandarin speech recognition
    Liao, YF
    Chen, SH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 252 - 263
  • [28] Extensions to the word graph method for large vocabulary continuous speech recognition
    Ney, H
    Ortmanns, S
    Lindam, I
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1791 - 1794
  • [29] Speech adaptation using neural networks for connected digit recognition
    Cheng, XL
    Wang, H
    Li, ZG
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2401 - 2404
  • [30] AUDIO-VISUAL ISOLATED DIGIT RECOGNITION FOR WHISPERED SPEECH
    Fan, Xing
    Busso, Carlos
    Hansen, John H. L.
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1500 - 1503