Improved Language Models for ASR using Written Language Text

被引:1
|
作者
Mukherji, Kaustuv [1 ]
Pandharipande, Meghna [1 ]
Kopparapu, Sunil Kumar [1 ]
机构
[1] Tata Consultancy Serv Ltd, TCS Res, Mumbai, Maharashtra, India
关键词
Language Model; Speech Recognition; SPEECH;
D O I
10.1109/NCC55593.2022.9806803
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The performance of an Automatic Speech Recognition (ASR) engine primarily depends on (a) the acoustic model (AM), (b) the language model (LM) and (c) the lexicon (Lx). While the contribution of each block to the overall performance of an ASR cannot be measured separately , a good LM helps in performance improvement in case of a domain specific ASR at a smaller cost. Generally, LM is greener compared to building AM and is much easier to build, for a domain specific ASR because it requires only domain specific text corpora. Traditionally, because of its ready availability, written language text (WLT) corpora has been used to build LM though there is an agreement that there a significant difference between WLT and spoken language text (SLT). In this paper, we explore methods and techniques that can be used to convert WLT into a form that realizes a better LM to support ASR performance.
引用
收藏
页码:362 / 366
页数:5
相关论文
共 50 条
  • [21] Uyghur Morpheme-based Language Models and ASR
    Ablimit, Mijit
    Neubig, Graham
    Mimura, Masato
    Mori, Shinsuke
    Kawahara, Tatsuya
    Hamdulla, Askar
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 581 - +
  • [22] Integration of complex language models in ASR and LU systems
    Justo, Raquel
    Torres, M. Ines
    PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (03) : 493 - 505
  • [23] Language and task independent text categorization with simple language models
    Peng, FC
    Schuurmans, D
    Wang, SJ
    HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 189 - 196
  • [24] SPOKEN LANGUAGE, WRITTEN LANGUAGE
    KLEIN, W
    LILI-ZEITSCHRIFT FUR LITERATURWISSENSCHAFT UND LINGUISTIK, 1985, 15 (59): : 9 - 35
  • [25] THE CHILDS LANGUAGE AND WRITTEN LANGUAGE
    BERKO, J
    EDUCATION, 1965, 86 (03): : 151 - 153
  • [26] ASR for Romanian language
    Gavat, Inge
    Dumitru, C. O.
    2007 14TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNALS, & IMAGE PROCESSING & EURASIP CONFERENCE FOCUSED ON SPEECH & IMAGE PROCESSING, MULTIMEDIA COMMUNICATIONS & SERVICES, 2007, : 52 - 55
  • [27] Personalized text snippet extraction using statistical language models
    Li, Qing
    Chen, Yuanzhu Peter
    PATTERN RECOGNITION, 2010, 43 (01) : 378 - 386
  • [28] Predicting Numerals in Text Using Nearest Neighbor Language Models
    Sakamoto, Taku
    Aizawa, Akiko
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4795 - 4809
  • [29] Unsupervised Text Style Transfer using Language Models as Discriminators
    Yang, Zichao
    Hu, Zhiting
    Dyer, Chris
    Xing, Eric P.
    Berg-Kirkpatrick, Taylor
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [30] Oral language, written language and language awareness
    Parisse, C
    JOURNAL OF CHILD LANGUAGE, 2002, 29 (02) : 478 - 481