Context-dependent phoneme duration modeling with tree-based state tying

被引：1

作者：

Park, SJ ^{[1
]}

Koo, MW

Jhon, CS

机构：

[1] Serv Dev Lab KT, Seoul, South Korea

[2] Seoul Natl Univ, Sch Comp Sci & Engn, Seoul, South Korea

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2005年 / E88D卷 / 03期

关键词：

duration model; gamma distribution; tree-based state tying;

D O I：

10.1093/ietisy/e88-d.3.662

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This letter presents two methods of modeling phoneme durations. One is the context-independent phoneme duration modeling in which duration parameters are stored in each phoneme. The other is the context-dependent duration modeling in which duration parameters are stored in each state shared by context-dependent phonemes. The phoneme duration model is compared with a without-duration model and a state duration model. Experiments are performed on a database collected over the telephone network. Experimental results show that duration information rejects out-of-task (OOT) words, well and that the context-dependent duration model yields the best performance among the tested models.

引用

页码：662 / 666

页数：5

共 50 条

[31] Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling
Liu, MK
Xu, B
Huang, TY
Deng, YG
Li, CR
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1025 - 1028
[32] MDL-based context-dependent subword modeling for speech recognition
Shinoda, Koichi
Watanabe, Takao
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 2000, 21 (02): : 79 - 86
[33] A frame-based context-dependent acoustic modeling for speech recognition
Terashima R.
Zen H.
Nankaku Y.
Tokuda K.
IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (10) : 1856 - 1864+24
[34] On Tree-Based Neural Sentence Modeling
Shi, Haoyue
Zhou, Hao
Chen, Jiaze
Li, Lei
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4631 - 4641
[35] Analysis of context-dependent segmental duration for automatic speech recognition
Wang, X
Pols, LCW
tenBosch, LFM
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1181 - 1184
[36] A Tree-Based Context Model for Object Recognition
Choi, Myung Jin
Torralba, Antonio
Willsky, Alan S.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) : 240 - 252
[37] Novel lookahead decision tree state tying for acoustic modeling
Xue, Jian
Zhao, Yunxin
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1133 - +
[38] Tree-based Phone Duration Modelling of the Serbian Language
Sovilj-Nikic, S.
Delic, V.
Sovilj-Nikic, I.
Markovic, M.
ELEKTRONIKA IR ELEKTROTECHNIKA, 2014, 20 (03) : 77 - 82
[39] Decision tree-based context dependent sublexical units for Continuous Speech Recognition of Basque
de Ipiña, KL
Graña, M
Ezeiza, N
Hernández, M
Zulueta, E
Ezeiza, A
PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 259 - 265
[40] Modeling context-dependent conformation parameters of DNA duplexes
Vorobjev Y.N.
Emel'Yanov D.Yu.
Biophysics, 2006, 51 (Suppl 1) : 28 - 34

← 1 2 3 4 5 →