A CONFIGURABLE MULTILINGUAL MODEL IS ALL YOU NEED TO RECOGNIZE ALL LANGUAGES

被引：4

作者：

Zhou, Long ^{[1
]}

Li, Jinyu ^{[2
]}

Sun, Eric ^{[2
]}

Liu, Shujie ^{[1
]}

机构：

[1] Microsoft Res Asia, Beijing, Peoples R China

[2] Microsoft Speech & Language Grp, Beijing, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

multilingual speech recognition; configurable multilingual model; transformer-transducer;

D O I：

10.1109/ICASSP43922.2022.9747905

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Multilingual automatic speech recognition models have shown great promise in recent years because of the simple model training and deployment process. Conventional methods either train a universal multilingual model without taking any language information or with a 1-hot language ID (LID) vector to guide the recognition of the target language. In practice, a multilingual user can be prompted to pre-select several languages he/she can speak. The multilingual model without LID cannot well utilize the language information set by the user while the multilingual model with 1-hot LID can only handle one pre-selected language. In this paper, we propose a novel configurable multilingual model (CMM) which is trained only once but can be configured as different models based on users' choices by extracting language-specific modules together with a universal module from the trained CMM. Particularly, a single CMM can be deployed to any user scenario where the users can pre-select any combination of languages. Trained with 75K hours of transcribed anonymized Microsoft multilingual data and evaluated with 10-language test sets, the proposed CMM improves from the universal multilingual model by 26.0%, 16.9%, and 10.4% relative word error reduction when the user selects 1, 2, or 3 languages, respectively.

引用

页码：6422 / 6426

页数：5

共 50 条

[1] All you need for all you do
British Dental Journal, 2015, 219 (11) : 555 - 555
[2] Are All Languages Created Equal in Multilingual BERT?
Wu, Shijie
Dredze, Mark
5TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2020), 2020, : 120 - 130
[3] Knowledge Is All You Need
Miracchi, Lisa
PHILOSOPHICAL ISSUES, 2015, 25 (01) : 353 - 378
[4] Love Is All You Need
Stables, Kate
SIGHT AND SOUND, 2013, 23 (05): : 100 - 100
[5] All the Tools You Need
Shook, Ray
WELDING JOURNAL, 2010, 89 (10) : 4 - 4
[6] ALL YOU NEED IS A BOWL
HANNIGAN, KJ
FOOD ENGINEERING, 1969, 41 (07): : 46 - &
[7] All You Need is Love
Hartwig, Mervyn
JOURNAL OF CRITICAL REALISM, 2015, 14 (02) : 205 - 224
[8] All you need is compassion
Pnueli, Amir
Sa'ar, Yaniv
VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION, 2008, 4905 : 233 - +
[9] ALL YOU NEED IS DEATH
Luckhurst, Roger
SIGHT AND SOUND, 2024, 34 (04): : 74 - 74
[10] 'All you need is love'
Palmer, Tony
SIGHT AND SOUND, 2008, 18 (09): : 90 - 90

← 1 2 3 4 5 →