MULTILINGUAL DEEP NEURAL NETWORK BASED ACOUSTIC MODELING FOR RAPID LANGUAGE ADAPTATION

被引:0
|
作者
Ngoc Thang Vu [1 ]
Imseng, David
Povey, Daniel
Motlicek, Petr
Schultz, Tanja [1 ]
Bourlard, Herve
机构
[1] Karlsruhe Inst Technol, D-76021 Karlsruhe, Germany
关键词
Multilingual DNN; phone merging; rapid language adaptation; KL-HMM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a study on multilingual deep neural network (DNN) based acoustic modeling and its application to new languages. We investigate the effect of phone merging on multilingual DNN in context of rapid language adaptation. Moreover, the combination of multilingual DNNs with Kullback-Leibler divergence based acoustic modeling (KL-HMM) is explored. Using ten different languages from the Globalphone database, our studies reveal that crosslingual acoustic model transfer through multilingual DNNs is superior to unsupervised RBM pre-training and greedy layer-wise supervised training. We also found that KL-HMM based decoding consistently outperforms conventional hybrid decoding, especially in low-resource scenarios. Furthermore, the experiments indicate that multilingual DNN training equally benefits from simple phoneset concatenation and manually derived universal phonesets.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] A Method of Underwater Acoustic Signal Classification Based on Deep Neural Network
    Wei, Zhengxian
    Ju, Yang
    Song, Min
    2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, : 46 - 50
  • [42] DISCRIMINATIVELY TRAINED JOINT SPEAKER AND ENVIRONMENT REPRESENTATIONS FOR ADAPTATION OF DEEP NEURAL NETWORK ACOUSTIC MODELS
    Yin, Maofan
    Sivadas, Sunil
    Yu, Kai
    Ma, Bin
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5065 - 5069
  • [43] Cluster Adaptive Training for Deep Neural Network Based Acoustic Model
    Tan, Tian
    Qian, Yanmin
    Yu, Kai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (03) : 459 - 468
  • [44] Bayesian Learning for Deep Neural Network Adaptation
    Xie, Xurong
    Liu, Xunying
    Lee, Tan
    Wang, Lan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2096 - 2110
  • [45] Adaptation and Contextualization of Deep Neural Network Models
    Kollias, Dimitrios
    Yu, Miao
    Tagaris, Athanasios
    Leontidis, Georgios
    Kollias, Stefanos
    Stafylopatis, Andreas
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1204 - 1211
  • [46] MULTIFRAME DEEP NEURAL NETWORKS FOR ACOUSTIC MODELING
    Vanhoucke, Vincent
    Devin, Matthieu
    Heigold, Georg
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7582 - 7585
  • [47] Acoustic Events Processing with Deep Neural Network
    Conka, David
    Cizmar, Anton
    2019 29TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2019, : 228 - 231
  • [48] Acoustic to articulatory mapping with deep neural network
    Wu, Zhiyong
    Zhao, Kai
    Wu, Xixin
    Lan, Xinyu
    Meng, Helen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 9889 - 9907
  • [49] Acoustic to articulatory mapping with deep neural network
    Zhiyong Wu
    Kai Zhao
    Xixin Wu
    Xinyu Lan
    Helen Meng
    Multimedia Tools and Applications, 2015, 74 : 9889 - 9907
  • [50] Unfolded deep recurrent convolutional neural network with jump ahead connections for acoustic modeling
    Tran, Dung T.
    Delcroix, Marc
    Karita, Shigeki
    Hentschel, Michael
    Ogawa, Atsunori
    Nakatani, Tomohiro
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1596 - 1600