Improved Language Models for ASR using Written Language Text

被引:1
|
作者
Mukherji, Kaustuv [1 ]
Pandharipande, Meghna [1 ]
Kopparapu, Sunil Kumar [1 ]
机构
[1] Tata Consultancy Serv Ltd, TCS Res, Mumbai, Maharashtra, India
关键词
Language Model; Speech Recognition; SPEECH;
D O I
10.1109/NCC55593.2022.9806803
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The performance of an Automatic Speech Recognition (ASR) engine primarily depends on (a) the acoustic model (AM), (b) the language model (LM) and (c) the lexicon (Lx). While the contribution of each block to the overall performance of an ASR cannot be measured separately , a good LM helps in performance improvement in case of a domain specific ASR at a smaller cost. Generally, LM is greener compared to building AM and is much easier to build, for a domain specific ASR because it requires only domain specific text corpora. Traditionally, because of its ready availability, written language text (WLT) corpora has been used to build LM though there is an agreement that there a significant difference between WLT and spoken language text (SLT). In this paper, we explore methods and techniques that can be used to convert WLT into a form that realizes a better LM to support ASR performance.
引用
收藏
页码:362 / 366
页数:5
相关论文
共 50 条
  • [41] WRITTEN LANGUAGE
    RICKHEIT, G
    ZEITSCHRIFT FUR GERMANISTISCHE LINGUISTIK, 1983, 11 (02): : 221 - 225
  • [42] Hierarchical Pitman-Yor language models for ASR in meetings
    Huang, Songfang
    Renals, Steve
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 124 - 129
  • [43] Phrase classes in two-level language models for ASR
    Justo, Raquel
    Torres, M. Ines
    PATTERN ANALYSIS AND APPLICATIONS, 2009, 12 (04) : 427 - 437
  • [44] INFORMATION-WEIGHTED NEURAL CACHE LANGUAGE MODELS FOR ASR
    Verwimp, Lyan
    Pelemans, Joris
    Van Hamme, Hugo
    Wambacq, Patrick
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 756 - 762
  • [45] Study of Morphological Factors of Factored Language Models for Russian ASR
    Kipyatkova, Irina
    Karpov, Alexey
    SPEECH AND COMPUTER, 2014, 8773 : 451 - 458
  • [46] Paraphrasing predicates from written language to spoken language using the web
    Kaji, N
    Okamoto, M
    Kurohashi, S
    HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 241 - 248
  • [47] INTRODUCTION + WRITTEN LANGUAGE, SPOKEN LANGUAGE
    KLEIN, W
    LILI-ZEITSCHRIFT FUR LITERATURWISSENSCHAFT UND LINGUISTIK, 1985, 15 (59): : 7 - 8
  • [48] From oral language to written language
    Billard, C
    Gillet, P
    Barthez, MA
    ARCHIVES DE PEDIATRIE, 1999, 6 : 387S - 388S
  • [49] Language play and learning of written language
    Dominguez, Paola
    Nasini, Stefano
    Teberosky, Ana
    INFANCIA Y APRENDIZAJE, 2013, 36 (04): : 501 - 515
  • [50] Discriminative Language Modeling Using Simulated ASR Errors
    Jyothi, Preethi
    Fosler-Lussier, Eric
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1049 - 1052