Context-dependent phoneme duration modeling with tree-based state tying

被引:1
|
作者
Park, SJ [1 ]
Koo, MW
Jhon, CS
机构
[1] Serv Dev Lab KT, Seoul, South Korea
[2] Seoul Natl Univ, Sch Comp Sci & Engn, Seoul, South Korea
来源
关键词
duration model; gamma distribution; tree-based state tying;
D O I
10.1093/ietisy/e88-d.3.662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This letter presents two methods of modeling phoneme durations. One is the context-independent phoneme duration modeling in which duration parameters are stored in each phoneme. The other is the context-dependent duration modeling in which duration parameters are stored in each state shared by context-dependent phonemes. The phoneme duration model is compared with a without-duration model and a state duration model. Experiments are performed on a database collected over the telephone network. Experimental results show that duration information rejects out-of-task (OOT) words, well and that the context-dependent duration model yields the best performance among the tested models.
引用
收藏
页码:662 / 666
页数:5
相关论文
共 50 条
  • [31] Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling
    Liu, MK
    Xu, B
    Huang, TY
    Deng, YG
    Li, CR
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1025 - 1028
  • [32] MDL-based context-dependent subword modeling for speech recognition
    Shinoda, Koichi
    Watanabe, Takao
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 2000, 21 (02): : 79 - 86
  • [33] A frame-based context-dependent acoustic modeling for speech recognition
    Terashima R.
    Zen H.
    Nankaku Y.
    Tokuda K.
    IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (10) : 1856 - 1864+24
  • [34] On Tree-Based Neural Sentence Modeling
    Shi, Haoyue
    Zhou, Hao
    Chen, Jiaze
    Li, Lei
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4631 - 4641
  • [35] Analysis of context-dependent segmental duration for automatic speech recognition
    Wang, X
    Pols, LCW
    tenBosch, LFM
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1181 - 1184
  • [36] A Tree-Based Context Model for Object Recognition
    Choi, Myung Jin
    Torralba, Antonio
    Willsky, Alan S.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) : 240 - 252
  • [37] Novel lookahead decision tree state tying for acoustic modeling
    Xue, Jian
    Zhao, Yunxin
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1133 - +
  • [38] Tree-based Phone Duration Modelling of the Serbian Language
    Sovilj-Nikic, S.
    Delic, V.
    Sovilj-Nikic, I.
    Markovic, M.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2014, 20 (03) : 77 - 82
  • [39] Decision tree-based context dependent sublexical units for Continuous Speech Recognition of Basque
    de Ipiña, KL
    Graña, M
    Ezeiza, N
    Hernández, M
    Zulueta, E
    Ezeiza, A
    PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 259 - 265
  • [40] Modeling context-dependent conformation parameters of DNA duplexes
    Vorobjev Y.N.
    Emel'Yanov D.Yu.
    Biophysics, 2006, 51 (Suppl 1) : 28 - 34