PHMM BASED ASYNCHRONOUS ACOUSTIC MODEL FOR CHINESE LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION

被引:0
|
作者
Wu, Hao [1 ]
Wu, Xihong [1 ]
Chi, Huisheng [1 ]
机构
[1] Peking Univ, Minist Educ, Key Lab Machine Percept, Hearing Res Ctr, Beijing 100871, Peoples R China
关键词
tonal language; multiple stream; PHMM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we presented an asynchronous multiple stream based Chinese tonal acoustic modeling framework. In this framework, toneless phonetic units and tones are modeled separately with different acoustic features. During the training, and decoding process, a set of models are coupled together with a product hidden Markov models (PHMM) to represent whole tonal phonetic units. Through this, a compound context dependent tonal model can be generated from a few simple models. Experiments show that such model scheme generates more compact and accurate model presentation and brings improvement on the performance for large vocabulary speech recognition tasks.
引用
收藏
页码:4477 / 4480
页数:4
相关论文
共 50 条
  • [31] IMPROVING LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION BY COMBINING GMM-BASED AND RESERVOIR-BASED ACOUSTIC MODELING
    Triefenbach, Fabian
    Demuynck, Kris
    Martens, Jean-Pierre
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 107 - 112
  • [32] Large-Vocabulary Continuous Speech Recognition Systems
    Saon, George
    Chien, Jen-Tzung
    IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 18 - 33
  • [33] Boosting systems for large vocabulary continuous speech recognition
    Saon, George
    Soltau, Hagen
    SPEECH COMMUNICATION, 2012, 54 (02) : 212 - 218
  • [34] Experimenting with lipreading for large vocabulary continuous speech recognition
    Karel Paleček
    Journal on Multimodal User Interfaces, 2018, 12 : 309 - 318
  • [35] Recent Developments in Large Vocabulary Continuous Speech Recognition
    Saon, George
    Chien, Jen-Tzung
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [36] Development of Large Vocabulary Continuous Speech Recognition for Polish
    Demenko, G.
    Szymanski, M.
    Cecko, R.
    Kusmierek, E.
    Lange, M.
    Wegner, K.
    Klessa, K.
    Owsianny, M.
    ACTA PHYSICA POLONICA A, 2012, 121 (1A) : A86 - A91
  • [37] A Myanmar Large Vocabulary Continuous Speech Recognition System
    Naing, Hay Mar Soe
    Hlaing, Aye Mya
    Pa, Win Pa
    Hu, Xinhui
    Thu, Ye Kyaw
    Hori, Chiori
    Kawai, Hisashi
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 320 - 327
  • [38] Investigation on large vocabulary continuous Kannada speech recognition
    Vanajakshi, Puttaswamy Gowda
    Mathivanan, M.
    Kumaran, T. Senthil
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2021, 36 (01) : 1 - 24
  • [39] Korean large vocabulary continuous speech recognition with morpheme-based recognition units
    Kwon, OW
    Park, J
    SPEECH COMMUNICATION, 2003, 39 (3-4) : 287 - 300
  • [40] Specifics of hidden Markov model modifications for large vocabulary continuous speech recognition
    Silingas, D
    Telksnys, L
    INFORMATICA, 2004, 15 (01) : 93 - 110