LEARNING FROM THE BEST: A TEACHER-STUDENT MULTILINGUAL FRAMEWORK FOR LOW-RESOURCE LANGUAGES

被引:0
|
作者
Bagchi, Deblin [1 ,2 ]
Hartmann, William [2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Raytheon BBN Technol, Cambridge, MA USA
关键词
Teacher-student learning; Low-resource speech; Multilingual training; Automatic speech recognition;
D O I
10.1109/icassp.2019.8683491
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The traditional method of pretraining neural acoustic models in low-resource languages consists of initializing the acoustic model parameters with a large, annotated multilingual corpus and can be a drain on time and resources. In an attempt to reuse TDNN-LSTMs already pre-trained using multilingual training, we have applied Teacher-Student ( TS) learning as a method of pretraining to transfer knowledge from a multilingual TDNN-LSTM to a TDNN. The pretraining time is reduced by an order of magnitude with the use of language-specific data during the teacher-student training. Additionally, the TS architecture allows us to leverage untranscribed data, previously untouched during supervised training. The best student TDNN achieves a WER within 1% of the teacher TDNN-LSTM performance and shows consistent improvement in recognition over TDNNs trained using the traditional pipeline over all the evaluation languages. Switching to TDNN from TDNN-LSTM also allows sub-real time decoding.
引用
收藏
页码:6051 / 6055
页数:5
相关论文
共 50 条
  • [31] General Sequence Teacher-Student Learning
    Wong, Jeremy Heng Meng
    Gales, Mark John Francis
    Wan, Yu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1725 - 1736
  • [32] Hypernymy Detection for Low-Resource Languages via Meta Learning
    Yu, Changlong
    Hang, Jialong
    Zhang, Haisong
    Ng, Wilfred
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3651 - 3656
  • [33] Lifelong Teacher-Student Network Learning
    Ye, Fei
    Bors, Adrian G.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6280 - 6296
  • [34] Multilingual Meta-Transfer Learning for Low-Resource Speech Recognition
    Zhou, Rui
    Koshikawa, Takaki
    Ito, Akinori
    Nose, Takashi
    Chen, Chia-Ping
    IEEE ACCESS, 2024, 12 : 158493 - 158504
  • [35] A Teacher-Student Framework for Maintainable Dialog Manager
    Wang, Weikang
    Zhang, Jiajun
    Zhang, Han
    Hwang, Mei-Yuh
    Zong, Chengqing
    Li, Zhifei
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3803 - 3812
  • [36] How to choose the best pivot language for automatic translation of low-resource languages
    Paul, Michael
    Finch, Andrew
    Sumita, Eiichrio
    ACM Transactions on Asian Language Information Processing, 2013, 12 (04):
  • [37] Parcing of low-resource languages from embedding of multilingual words Application to Northern Sami and Komi-Zyrian
    Lim, KyungTae
    Partanen, Niko
    Poibeau, Thierry
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2018, 59 (03): : 67 - 91
  • [38] AI-Tutor: Interactive Learning of Ancient Knowledge from Low-Resource Languages
    Dalal, Siddhartha
    Aditya, Rahul
    Raghuram, Vethavikashini Chithrra
    Koratamaddi, Prahlad
    WAT 2024 - 11th Workshop on Asian Translation, Proceedings of the Workshop, 2024, : 56 - 66
  • [39] Learning by reusing previous advice: a memory-based teacher-student framework
    Zhu, Changxi
    Cai, Yi
    Hu, Shuyue
    Leung, Ho-fung
    Chiu, Dickson K. W.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2023, 37 (01)
  • [40] School as teacher-student education context: learning from collaboration
    Rivas Flores, Jose I.
    Leite Mendez, Analia E.
    Cortes Gonzalez, Pablo
    PROFESORADO-REVISTA DE CURRICULUM Y FORMACION DE PROFESORADO, 2015, 19 (01): : 228 - 242