COPING WITH OUT-OF-VOCABULARY WORDS: OPEN VERSUS HUGE VOCABULARY ASR

被引：3

作者：

Gerosa, Matteo ^{[1
]}

Federico, Marcello ^{[1
]}

机构：

[1] FBK Irst Fdn Bruno Kessler, I-38100 Povo, TN, Italy

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年

关键词：

Automatic Speech Recognition; Open-vocabulary speech recognition; OOV words;

D O I：

10.1109/ICASSP.2009.4960583

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper investigates methods for coping with out-of-vocabulary words in a large vocabulary speech recognition task, namely the automatic transcription of Italian broadcast news. Two alternative ways for augmenting a 64K(thousand)-word recognition vocabulary and language model are compared: introducing extra words with their phonetic transcription up to 1.2M (million) words, or extending the language model with so-called graphones, i.e. sub-word units made of phone-character sequences. Graphones and phonetic transcriptions of words are automatically generated by adapting an off-the-shelf statistical machine translation toolkit. We found that the word-based and graphone-based extensions allow both for better recognition performance, with the former performing significantly better than the latter. In addition, the word-based extension approach shows interesting potential even under conditions of little supervision. In fact, by training the grapheme to phoneme translation system with only 2K manually verified transcriptions, the final word error rate increases by just 3% relative, with respect to starting from a lexicon of 64K Words.

引用

页码：4313 / 4316

页数：4

共 50 条

[1] Detection of Out-of-Vocabulary Words in Posterior Based ASR
Ketabdar, Hamed
Hannemann, Mirko
Hermansky, Hynek
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2772 - 2775
[2] Finding Recurrent Out-of-Vocabulary Words
Qin, Long
Rudnicky, Alexander
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2241 - 2245
[3] Lexicon Stratification for Translating Out-of-Vocabulary Words
Tsvetkov, Yulia
Dyer, Chris
PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 125 - 131
[4] Out-of-vocabulary Words Detection with Attention and CTC Alignments in an End-to-End ASR System
Egorova, Ekaterina
Vydana, Hari Krishna
Burget, Lukas
Cernocky, Jan
INTERSPEECH 2021, 2021, : 2901 - 2905
[5] RNN Language Model Estimation for Out-of-Vocabulary Words
Illina, Irina
Fohr, Dominique
HUMAN LANGUAGE TECHNOLOGY. CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, LTC 2017, 2020, 12598 : 199 - 211
[6] Handling Out-of-Vocabulary Words in Lexicons to Polarity Classification
Nascimento, Gabriel
Duarte, Fellipe
Guedes, Gustavo Paiva
PROCEEDINGS OF THE 17TH BRAZILIAN SYMPOSIUM ON HUMAN FACTORS IN COMPUTING SYSTEMS (IHC 2018), 2015,
[7] WASSUP? LOL : Characterizing Out-of-Vocabulary Words in Twitter
Maity, Suman Kalyan
Chaudhary, Anshit
Kumar, Shraman
Mukherjee, Animesh
Sarda, Chaitanya
Patil, Abhijeet
Mondal, Akash
PROCEEDINGS OF THE 19TH ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING COMPANION, 2016, : 341 - 344
[8] A category based approach for recognition of out-of-vocabulary words
Gallwitz, F
Noth, E
Niemann, H
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 228 - 231
[9] Similarity Scoring for Recognizing Repeated Out-of-Vocabulary Words
Hannemann, Mirko
Kombrink, Stefan
Karafiat, Martin
Burget, Lukas
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 897 - 900
[10] Phoneme-to-grapheme conversion for out-of-vocabulary words in large vocabulary speech recognition
Decadt, B
Duchateau, J
Daelemans, W
Wambacq, P
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 413 - 416

← 1 2 3 4 5 →