Full expansion of context-dependent networks in large vocabulary speech recognition

被引：0

作者：

Mohri, M ^{[1
]}

Riley, M ^{[1
]}

Hindle, D ^{[1
]}

Ljolje, A ^{[1
]}

Pereira, F ^{[1
]}

机构：

[1] AT&T Bell Labs, Res, Florham Park, NJ 07932 USA

来源：

PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 | 1998年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We combine our earlier approach to context-dependent network representation with our algorithm for determinizing weighted networks to build optimized networks for large-vocabulary speech recognition combining an n-gram language model, a pronunciation dictionary and context-dependency modeling. While fully-expanded networks have been used before in restrictive settings (medium vocabulary or no cross-word contexts), we demonstrate that our network determinization method makes it practical to use fully-expanded networks also in large-vocabulary recognition with full cross-word context modeling. For the DARPA North American Business News task (NAB), we give network sizes and recognition speeds and accuracies using bigram and trigram grammars with vocabulary sizes ranging from 10,000 to 160,000 words. With our construction, the fully-expanded NAB context-dependent networks contain only about twice as many arcs as the corresponding language models. Interestingly, we also find that, with these networks, real-time word accuracy is improved by increasing vocabulary size and n-gram order.

引用

页码：665 / 668

页数：4

共 50 条

[1] Context-dependent acoustic modeling using graphemes for large vocabulary speech recognition
Kanthak, S
Ney, H
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 845 - 848
[2] LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION WITH CONTEXT-DEPENDENT DBN-HMMS
Dahl, George E.
Yu, Dong
Deng, Li
Acero, Alex
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4688 - 4691
[3] Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition
Dahl, George E.
Yu, Dong
Deng, Li
Acero, Alex
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01): : 30 - 42
[4] Context-dependent units for vocabulary-independent Spanish speech recognition
Villarrubia, L
Gomez, LH
Elvira, JM
Torrecilla, JC
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 451 - 454
[5] ADAPTATION OF CONTEXT-DEPENDENT DEEP NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION
Yao, Kaisheng
Yu, Dong
Seide, Frank
Su, Hang
Deng, Li
Gong, Yifan
2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 366 - 369
[6] Context-Dependent Deep Neural Networks for Commercial Mandarin Speech Recognition Applications
Niu, Jianwei
Xie, Lei
Jia, Lei
Hu, Na
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
[7] Hybrid methodological approach to context-dependent speech recognition
Miskovic, Dragisa
Gnjatovic, Milan
Strbac, Perica
Trenkic, Branimir
Jakovljevic, Niksa
Delic, Vlado
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2017, 14 (01):
[8] Context-dependent quantization for distributed and/or robust speech recognition
Wan, Chia-Yu
Chen, Yi
Lee, Lin-Shan
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4413 - 4416
[9] Context-dependent acoustic models for Chinese speech recognition
Ma, B
Huang, TY
Xu, B
Zhang, XJ
Qu, F
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 455 - 458
[10] WithYou: Automated Adaptive Speech Tutoring With Context-Dependent Speech Recognition
Zhang, Xinlei
Miyaki, Takashi
Rekimoto, Jun
PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,

← 1 2 3 4 5 →