Tree-Based HMM State Tying for Arabic Continuous Speech Recognition

被引:0
|
作者
Azim, Mona A. [1 ]
Hamid, A. Aziz A. [1 ]
Badr, Nagwa L. [1 ]
Tolba, M. F. [1 ]
机构
[1] Ain Shams Univ, Fac Comp & Informat Sci, Cairo, Egypt
关键词
Arabic phonemes; Tri-phones hmms; Speech recognition; SIMILARITY;
D O I
10.1007/978-3-319-48308-5_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the major challenges in building Hidden Markov Models (HMMs) for continuous speech recognition systems is the balance between the available training set and the recognition performance. For large vocabulary recognition systems, context dependent models are usually required to obtain higher recognition accuracy. This is crucial as most of the language contexts may not occur in the training set. This paper proposes an Arabic phonetic decision tree necessary to build tied state tri-phone HMMs. Experimental results based on the proposed decision tree show a promising recognition accuracy when compared with the traditional context independent models using the same training and testing sets. The maximum recognition accuracy achieved by the proposed approach was 92.8 % whereas it reached 61.5 % when tested using context independent HMMs.
引用
收藏
页码:96 / 103
页数:8
相关论文
共 50 条
  • [31] Feature-table-based automatic question generation for tree-based state tying: A practical implementation
    Kanokphara, S
    Carson-Berndsen, J
    INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2005, 3533 : 95 - 97
  • [32] Context-dependent HMM modeling using tree-based clustering for the recognition of handwritten words
    Bianne, Anne-Laure
    Kermorvant, Christopher
    Likforman-Sulem, Laurence
    DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
  • [33] Improving Arabic HMM Based Speech Synthesis Quality
    Abdel-Hamid, Ossama
    Abdou, Sherif
    Rashwan, Mohsen
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1332 - +
  • [34] Scalable HMM based Inference Engine in Large Vocabulary Continuous Speech Recognition
    Chong, Jike
    You, Kisun
    Yi, Youngmin
    Gonina, Ekaterina
    Hughes, Christopher
    Sung, Wonyong
    Keutzer, Kurt
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1793 - +
  • [35] A Covariance-Tying Technique for HMM-Based Speech Synthesis
    Oura, Keiichiro
    Zen, Heiga
    Nankaku, Yoshihiko
    Lee, Akinobu
    Tokuda, Keiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (03): : 595 - 601
  • [36] Implicit State-Tying for Support Vector Machines Based Speech Recognition
    Bolanos, Daniel
    Ward, Wayne
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 924 - 927
  • [37] HMM Based Continuous EOG Recognition for Eye-input Speech Interface
    Fang, Fuming
    Shinozaki, Takahiro
    Horiuchi, Yasuo
    Kuroiwa, Shingo
    Furui, Sadaoki
    Musha, Toshimitsu
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 734 - 737
  • [38] A study on rescoring using HMM-based detectors for continuous speech recognition
    Fu, Qiang
    Juang, Biing-Hwang
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 570 - 575
  • [39] Graphical Models for the Recognition of Arabic continuous speech based Triphones modeling
    Zarrouk, Elyes
    Benayed, Yassine
    Gargouri, Faiez
    2015 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2015, : 603 - 608
  • [40] A TREE-BASED STATISTICAL LANGUAGE MODEL FOR NATURAL-LANGUAGE SPEECH RECOGNITION
    BAHL, LR
    BROWN, PF
    DESOUZA, PV
    MERCER, RL
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (07): : 1001 - 1008