GMM-derived features for effective unsupervised adaptation of deep neural network acoustic models

被引:0
|
作者
Tomashenko, Natalia [1 ,2 ]
Khokhlov, Yuri [3 ]
机构
[1] Speech Technol Ctr, St Petersburg, Russia
[2] ITMO Univ, St Petersburg, Russia
[3] STC Innovat Ltd, St Petersburg, Russia
关键词
speaker adaptation; deep neural networks (DNN); MAP; fMLLR; CD-DNN-HMM; GMM-derived (GMMD) features; speaker adaptive training (SAT);
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we investigate GMM-derived features recently introduced for adaptation of context-dependent deep neural network HMM (CD-DNN-HMM) acoustic models. We improve the previously proposed adaptation algorithm by applying the concept of speaker adaptive training (SAT) to DNNs built on GMM-derived features and by using fMLLR-adapted features for training an auxiliary GMM model. Traditional adaptation algorithms, such as maximum a posteriori adaptation (MAP) and feature space maximum likelihood linear regression (fMLLR) are performed for the auxiliary GMM model used in a SAT procedure for a DNN. Experimental results on the Wall Street Journal (WSJ0) corpus show that the proposed adaptation technique can provide, on average, a 17-28% relative word error rate (WER) reduction on different adaptation sets under an unsupervised adaptation setup, compared to speaker independent (SI) DNN-HMM systems built on conventional features. We found that fMLLR adaptation for the SAT DNN trained on GMM-derived features outperforms fMLLR adaptation for the SAT DNN trained on conventional features by up to 14% of relative WER reduction.
引用
收藏
页码:2882 / 2886
页数:5
相关论文
共 50 条
  • [21] Benchmarking Test-Time Unsupervised Deep Neural Network Adaptation on Edge Devices
    Bhardwaj, Kshitij
    Diffenderfer, James
    Kailkhura, Bhavya
    Gokhale, Maya
    2022 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2022), 2022, : 236 - 238
  • [22] Factorized Hidden Layer Adaptation for Deep Neural Network Based Acoustic Modeling
    Samarakoon, Lahiru
    Sim, Khe Chai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2241 - 2250
  • [23] MULTILINGUAL DEEP NEURAL NETWORK BASED ACOUSTIC MODELING FOR RAPID LANGUAGE ADAPTATION
    Ngoc Thang Vu
    Imseng, David
    Povey, Daniel
    Motlicek, Petr
    Schultz, Tanja
    Bourlard, Herve
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [24] Restructuring of Deep Neural Network Acoustic Models with Singular Value Decomposition
    Xue, Jian
    Li, Jinyu
    Gong, Yifan
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2364 - 2368
  • [25] Speaker Adaptation of Neural Network Acoustic Models Using I-Vectors
    Saon, George
    Soltau, Hagen
    Nahamoo, David
    Picheny, Michael
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 55 - 59
  • [26] DEEP NEURAL NETWORK DERIVED BOTTLENECK FEATURES FOR ACCURATE AUDIO CLASSIFICATION
    Zhang, Bihong
    Xie, Lei
    Yuan, Yougen
    Ming, Huaiping
    Huang, Dongyan
    Song, Mingli
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2016,
  • [27] Batch Normalization based Unsupervised Speaker Adaptation for Acoustic Models
    Yi, Jiangyan
    Tao, Jianhua
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 176 - 180
  • [28] A Hybrid Dynamic Time Warping-Deep Neural Network Architecture for Unsupervised Acoustic Modeling
    Thiolliere, Roland
    Dunbar, Ewan
    Synnaeve, Gabriel
    Versteegh, Maarten
    Dupoux, Emmanuel
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3179 - 3183
  • [29] On Fine-Tuned Deep Features for Unsupervised Domain Adaptation
    Wang, Qian
    Meng, Fanlin
    Breckon, Toby P.
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [30] Unsupervised domain adaptation based on deep adapted features alignment
    Zhou, Shaokang
    Shi, Xiasheng
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (01)