Pronunciation modeling using a finite-state transducer representation

被引:17
|
作者
Hazen, TJ [1 ]
Hetherington, IL [1 ]
Shu, H [1 ]
Livescu, K [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
D O I
10.1016/j.specom.2005.03.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The MIT SUMMIT speech recognition system models pronunciation using a phonemic baseform dictionary along with rewrite rules for modeling phonological variation and multi-word reductions. Each pronunciation component is encoded within a finite-state transducer (FST) representation whose transition weights can be trained using an EM algorithm for finite-state networks. This paper explains the modeling approach we use and the details of its realization. We demonstrate the benefits and weaknesses of the approach both conceptually and empirically using the recognizer for our JUPITER weather information system. Our experiments demonstrate that the use of phonological rewrite rules within our system achieves word error rate reductions between 4% and 9% over different test sets when compared against a system using no phonological rewrite rules. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:189 / 203
页数:15
相关论文
共 50 条
  • [11] A Sequential Minimization Algorithm for Finite-State Pronunciation Lexicon Models
    Dobrisek, Simon
    Vesnicer, Bostjan
    Mihelic, France
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 720 - 723
  • [12] The design principles of a weighted finite-state transducer library
    Mohri, M
    Pereira, F
    Riley, M
    THEORETICAL COMPUTER SCIENCE, 2000, 231 (01) : 17 - 32
  • [13] An Expanded Finite-State Transducer for Tsuut'ina Verbs
    Holden, Joshua
    Cox, Christopher
    Arppe, Antti
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5143 - 5152
  • [14] Juicer: A weighted finite-state transducer speech decoder
    Moore, Darren
    Dines, John
    Doss, Mathew Magimai
    Vepa, Jithendra
    Cheng, Octavian
    Hain, Thomas
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 285 - +
  • [15] A rational design for a weighted finite-state transducer library
    Mohri, M
    Pereira, F
    Riley, M
    AUTOMATA IMPLEMENTATION, 1998, 1436 : 144 - 158
  • [16] Weighted finite-state transducer-based dysarthric speech recognition error correction using context-dependent pronunciation variation modelling
    Seong, Woo Kyeong
    Park, Ji Hun
    INTERNATIONAL JOURNAL OF ENGINEERING SYSTEMS MODELLING AND SIMULATION, 2014, 6 (1-2) : 4 - 11
  • [17] Formal Modeling of RESTful Systems Using Finite-State Machines
    Zuzak, Ivan
    Budiselic, Ivan
    Delac, Goran
    WEB ENGINEERING, ICWE 2011, 2011, 6757 : 346 - 360
  • [18] APPROACHES TO FINITE-STATE MACHINE MODELING
    ODELL, J
    JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1995, 7 (08): : 14 - &
  • [19] OpenFst: A general and efficient weighted finite-state transducer library
    Allauzen, Cyril
    Riley, Michael
    Schalkwyk, Johan
    Skut, Wojciech
    Mohri, Mehryar
    IMPLEMENTATION AND APPLICATION OF AUTOMATA, 2007, 4783 : 11 - +
  • [20] Learning a Discriminative Weighted Finite-State Transducer for Speech Recognition
    Lehr, Maider
    Shafran, Izhak
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1360 - 1367