LEARNING FROM THE BEST: A TEACHER-STUDENT MULTILINGUAL FRAMEWORK FOR LOW-RESOURCE LANGUAGES

被引:0
|
作者
Bagchi, Deblin [1 ,2 ]
Hartmann, William [2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Raytheon BBN Technol, Cambridge, MA USA
关键词
Teacher-student learning; Low-resource speech; Multilingual training; Automatic speech recognition;
D O I
10.1109/icassp.2019.8683491
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The traditional method of pretraining neural acoustic models in low-resource languages consists of initializing the acoustic model parameters with a large, annotated multilingual corpus and can be a drain on time and resources. In an attempt to reuse TDNN-LSTMs already pre-trained using multilingual training, we have applied Teacher-Student ( TS) learning as a method of pretraining to transfer knowledge from a multilingual TDNN-LSTM to a TDNN. The pretraining time is reduced by an order of magnitude with the use of language-specific data during the teacher-student training. Additionally, the TS architecture allows us to leverage untranscribed data, previously untouched during supervised training. The best student TDNN achieves a WER within 1% of the teacher TDNN-LSTM performance and shows consistent improvement in recognition over TDNNs trained using the traditional pipeline over all the evaluation languages. Switching to TDNN from TDNN-LSTM also allows sub-real time decoding.
引用
收藏
页码:6051 / 6055
页数:5
相关论文
共 50 条
  • [41] Voice Activation for Low-Resource Languages
    Kolesau, Aliaksei
    Sesok, Dmitrij
    APPLIED SCIENCES-BASEL, 2021, 11 (14):
  • [42] Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon
    Koto, Fajri
    Beck, Tilman
    Talat, Zeerak
    Gurevych, Iryna
    Baldwin, Timothy
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 298 - 320
  • [43] Optimizing Multilingual Sentiment Analysis in Low-Resource Languages with Adaptive Pretraining and Strategic Language Selection
    Raychawdhary, Nilanjana
    Das, Amit
    Bhattacharya, Sutanu
    Dozier, Gerry
    Seals, Cheryl D.
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [44] Traffic signal control using reinforcement learning based on the teacher-student framework
    Liu, Junxiu
    Qin, Sheng
    Su, Min
    Luo, Yuling
    Zhang, Shunsheng
    Wang, Yanhu
    Yang, Su
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [45] A neural approach for inducing multilingual resources and natural language processing tools for low-resource languages
    Zennaki, O.
    Semmar, N.
    Besacier, L.
    NATURAL LANGUAGE ENGINEERING, 2019, 25 (01) : 43 - 67
  • [46] Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
    Joyanta Basu
    Soma Khan
    Rajib Roy
    Tapan Kumar Basu
    Swanirbhar Majumder
    Circuits, Systems, and Signal Processing, 2021, 40 : 4986 - 5013
  • [47] Multilingual Dependency Parsing for Low-Resource African Languages: Case Studies on Bambara, Wolof, and Yoruba
    Dione, Cheikh Bamba
    IWPT 2021: THE 17TH INTERNATIONAL CONFERENCE ON PARSING TECHNOLOGIES: PROCEEDINGS OF THE CONFERENCE (INCLUDING THE IWPT 2021 SHARED TASK), 2021, : 84 - 92
  • [48] A Systematic Review and Analysis of Multilingual Data Strategies in Text-to-Speech for Low-Resource Languages
    Do, Phat
    Coler, Matt
    Dijkstra, Jelske
    Klabbers, Esther
    INTERSPEECH 2021, 2021, : 16 - 20
  • [49] Teacher-Student Relationship and Learning Difficulty in Distance Learning
    Yang, Kai-Hsiang
    Wu, I-Hwey
    2013 INTERNATIONAL CONFERENCE ON EDUCATION AND EDUCATIONAL RESEARCH (EER 2013), VOL 2, 2013, 2 : 133 - 138
  • [50] Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
    Basu, Joyanta
    Khan, Soma
    Roy, Rajib
    Basu, Tapan Kumar
    Majumder, Swanirbhar
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (10) : 4986 - 5013