Pronunciation modeling using a finite-state transducer representation

被引：17

作者：

Hazen, TJ ^{[1
]}

Hetherington, IL ^{[1
]}

Shu, H ^{[1
]}

Livescu, K ^{[1
]}

机构：

[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA

来源：

SPEECH COMMUNICATION | 2005年 / 46卷 / 02期

关键词：

D O I：

10.1016/j.specom.2005.03.004

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The MIT SUMMIT speech recognition system models pronunciation using a phonemic baseform dictionary along with rewrite rules for modeling phonological variation and multi-word reductions. Each pronunciation component is encoded within a finite-state transducer (FST) representation whose transition weights can be trained using an EM algorithm for finite-state networks. This paper explains the modeling approach we use and the details of its realization. We demonstrate the benefits and weaknesses of the approach both conceptually and empirically using the recognizer for our JUPITER weather information system. Our experiments demonstrate that the use of phonological rewrite rules within our system achieves word error rate reductions between 4% and 9% over different test sets when compared against a system using no phonological rewrite rules. (c) 2005 Elsevier B.V. All rights reserved.

引用

页码：189 / 203

页数：15

共 50 条

[11] A Sequential Minimization Algorithm for Finite-State Pronunciation Lexicon Models
Dobrisek, Simon
Vesnicer, Bostjan
Mihelic, France
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 720 - 723
[12] The design principles of a weighted finite-state transducer library
Mohri, M
Pereira, F
Riley, M
THEORETICAL COMPUTER SCIENCE, 2000, 231 (01) : 17 - 32
[13] An Expanded Finite-State Transducer for Tsuut'ina Verbs
Holden, Joshua
Cox, Christopher
Arppe, Antti
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5143 - 5152
[14] Juicer: A weighted finite-state transducer speech decoder
Moore, Darren
Dines, John
Doss, Mathew Magimai
Vepa, Jithendra
Cheng, Octavian
Hain, Thomas
MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 285 - +
[15] A rational design for a weighted finite-state transducer library
Mohri, M
Pereira, F
Riley, M
AUTOMATA IMPLEMENTATION, 1998, 1436 : 144 - 158
[16] Weighted finite-state transducer-based dysarthric speech recognition error correction using context-dependent pronunciation variation modelling
Seong, Woo Kyeong
Park, Ji Hun
INTERNATIONAL JOURNAL OF ENGINEERING SYSTEMS MODELLING AND SIMULATION, 2014, 6 (1-2) : 4 - 11
[17] Formal Modeling of RESTful Systems Using Finite-State Machines
Zuzak, Ivan
Budiselic, Ivan
Delac, Goran
WEB ENGINEERING, ICWE 2011, 2011, 6757 : 346 - 360
[18] APPROACHES TO FINITE-STATE MACHINE MODELING
ODELL, J
JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1995, 7 (08): : 14 - &
[19] OpenFst: A general and efficient weighted finite-state transducer library
Allauzen, Cyril
Riley, Michael
Schalkwyk, Johan
Skut, Wojciech
Mohri, Mehryar
IMPLEMENTATION AND APPLICATION OF AUTOMATA, 2007, 4783 : 11 - +
[20] Learning a Discriminative Weighted Finite-State Transducer for Speech Recognition
Lehr, Maider
Shafran, Izhak
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1360 - 1367

← 1 2 3 4 5 →