Improved Language Models for ASR using Written Language Text

被引：1

作者：

Mukherji, Kaustuv ^{[1
]}

Pandharipande, Meghna ^{[1
]}

Kopparapu, Sunil Kumar ^{[1
]}

机构：

[1] Tata Consultancy Serv Ltd, TCS Res, Mumbai, Maharashtra, India

来源：

2022 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC) | 2022年

关键词：

Language Model; Speech Recognition; SPEECH;

D O I：

10.1109/NCC55593.2022.9806803

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

The performance of an Automatic Speech Recognition (ASR) engine primarily depends on (a) the acoustic model (AM), (b) the language model (LM) and (c) the lexicon (Lx). While the contribution of each block to the overall performance of an ASR cannot be measured separately , a good LM helps in performance improvement in case of a domain specific ASR at a smaller cost. Generally, LM is greener compared to building AM and is much easier to build, for a domain specific ASR because it requires only domain specific text corpora. Traditionally, because of its ready availability, written language text (WLT) corpora has been used to build LM though there is an agreement that there a significant difference between WLT and spoken language text (SLT). In this paper, we explore methods and techniques that can be used to convert WLT into a form that realizes a better LM to support ASR performance.

引用

页码：362 / 366

页数：5

共 50 条

[41] WRITTEN LANGUAGE
RICKHEIT, G
ZEITSCHRIFT FUR GERMANISTISCHE LINGUISTIK, 1983, 11 (02): : 221 - 225
[42] Hierarchical Pitman-Yor language models for ASR in meetings
Huang, Songfang
Renals, Steve
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 124 - 129
[43] Phrase classes in two-level language models for ASR
Justo, Raquel
Torres, M. Ines
PATTERN ANALYSIS AND APPLICATIONS, 2009, 12 (04) : 427 - 437
[44] INFORMATION-WEIGHTED NEURAL CACHE LANGUAGE MODELS FOR ASR
Verwimp, Lyan
Pelemans, Joris
Van Hamme, Hugo
Wambacq, Patrick
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 756 - 762
[45] Study of Morphological Factors of Factored Language Models for Russian ASR
Kipyatkova, Irina
Karpov, Alexey
SPEECH AND COMPUTER, 2014, 8773 : 451 - 458
[46] Paraphrasing predicates from written language to spoken language using the web
Kaji, N
Okamoto, M
Kurohashi, S
HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 241 - 248
[47] INTRODUCTION + WRITTEN LANGUAGE, SPOKEN LANGUAGE
KLEIN, W
LILI-ZEITSCHRIFT FUR LITERATURWISSENSCHAFT UND LINGUISTIK, 1985, 15 (59): : 7 - 8
[48] From oral language to written language
Billard, C
Gillet, P
Barthez, MA
ARCHIVES DE PEDIATRIE, 1999, 6 : 387S - 388S
[49] Language play and learning of written language
Dominguez, Paola
Nasini, Stefano
Teberosky, Ana
INFANCIA Y APRENDIZAJE, 2013, 36 (04): : 501 - 515
[50] Discriminative Language Modeling Using Simulated ASR Errors
Jyothi, Preethi
Fosler-Lussier, Eric
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1049 - 1052

← 1 2 3 4 5 →