Tree-Based HMM State Tying for Arabic Continuous Speech Recognition

被引：0

作者：

Azim, Mona A. ^{[1
]}

Hamid, A. Aziz A. ^{[1
]}

Badr, Nagwa L. ^{[1
]}

Tolba, M. F. ^{[1
]}

机构：

[1] Ain Shams Univ, Fac Comp & Informat Sci, Cairo, Egypt

来源：

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016 | 2017年 / 533卷

关键词：

Arabic phonemes; Tri-phones hmms; Speech recognition; SIMILARITY;

D O I：

10.1007/978-3-319-48308-5_10

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One of the major challenges in building Hidden Markov Models (HMMs) for continuous speech recognition systems is the balance between the available training set and the recognition performance. For large vocabulary recognition systems, context dependent models are usually required to obtain higher recognition accuracy. This is crucial as most of the language contexts may not occur in the training set. This paper proposes an Arabic phonetic decision tree necessary to build tied state tri-phone HMMs. Experimental results based on the proposed decision tree show a promising recognition accuracy when compared with the traditional context independent models using the same training and testing sets. The maximum recognition accuracy achieved by the proposed approach was 92.8 % whereas it reached 61.5 % when tested using context independent HMMs.

引用

页码：96 / 103

页数：8

共 50 条

[31] Feature-table-based automatic question generation for tree-based state tying: A practical implementation
Kanokphara, S
Carson-Berndsen, J
INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2005, 3533 : 95 - 97
[32] Context-dependent HMM modeling using tree-based clustering for the recognition of handwritten words
Bianne, Anne-Laure
Kermorvant, Christopher
Likforman-Sulem, Laurence
DOCUMENT RECOGNITION AND RETRIEVAL XVII, 2010, 7534
[33] Improving Arabic HMM Based Speech Synthesis Quality
Abdel-Hamid, Ossama
Abdou, Sherif
Rashwan, Mohsen
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1332 - +
[34] Scalable HMM based Inference Engine in Large Vocabulary Continuous Speech Recognition
Chong, Jike
You, Kisun
Yi, Youngmin
Gonina, Ekaterina
Hughes, Christopher
Sung, Wonyong
Keutzer, Kurt
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1793 - +
[35] A Covariance-Tying Technique for HMM-Based Speech Synthesis
Oura, Keiichiro
Zen, Heiga
Nankaku, Yoshihiko
Lee, Akinobu
Tokuda, Keiichi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (03): : 595 - 601
[36] Implicit State-Tying for Support Vector Machines Based Speech Recognition
Bolanos, Daniel
Ward, Wayne
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 924 - 927
[37] HMM Based Continuous EOG Recognition for Eye-input Speech Interface
Fang, Fuming
Shinozaki, Takahiro
Horiuchi, Yasuo
Kuroiwa, Shingo
Furui, Sadaoki
Musha, Toshimitsu
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 734 - 737
[38] A study on rescoring using HMM-based detectors for continuous speech recognition
Fu, Qiang
Juang, Biing-Hwang
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 570 - 575
[39] Graphical Models for the Recognition of Arabic continuous speech based Triphones modeling
Zarrouk, Elyes
Benayed, Yassine
Gargouri, Faiez
2015 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2015, : 603 - 608
[40] A TREE-BASED STATISTICAL LANGUAGE MODEL FOR NATURAL-LANGUAGE SPEECH RECOGNITION
BAHL, LR
BROWN, PF
DESOUZA, PV
MERCER, RL
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (07): : 1001 - 1008

← 1 2 3 4 5 →