Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation

被引:0
|
作者
Gales, MJF
Pye, D
Woodland, PC
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the use of maximum likelihood linear regression (MLLR) for both speaker and environment adaptation. MLLR transforms the mean and variance parameters of a set of HMMs. In this paper a number of different types of linear transformations of the variances are examined including full, block diagonal, and diagonal transformation matrices. Experiments on large vocabulary speaker independent data sets are described. On all the data sets examined the use of MLLR mean and variance compensation reduced the error rate compared to mean-only compensation. Furthermore, the use of a block diagonal or full transformation of the variances on the clean data task showed slight improvements over the diagonal case. However, when some environmental mismatch was present then was no difference in performance between using multiple diagonal variance transformations and a more complex single variance transform.
引用
收藏
页码:1832 / 1835
页数:4
相关论文
共 50 条
  • [1] Eigen-MLLR Environment/Speaker Compensation for Robust Speech Recognition
    Liao, Yuan-Fu
    Fang, Hung-Hsiang
    Hsu, Chi-Hui
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1249 - 1252
  • [2] Mean and variance adaptation within the MLLR framework
    Gales, MJF
    Woodland, PC
    COMPUTER SPEECH AND LANGUAGE, 1996, 10 (04): : 249 - 264
  • [3] Analysis on MAP and MLLR Based Speaker Adaptation Techniques in Speech Recognition
    Ramya, T.
    Christina, Lilly S.
    Vijayalakshmi, P.
    Nagarajan, T.
    2014 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2014), 2014, : 1753 - 1758
  • [4] Channel and speaker adaptation techniques for robust speech recognition
    Chen, Jingdong
    Yao, Lei
    Huang, Taiyi
    Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544
  • [5] RAPID JOINT SPEAKER AND NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION
    Chin, K. K.
    Xu, Haitian
    Gales, Mark J. F.
    Breslin, Catherine
    Knill, Kate
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5500 - 5503
  • [6] Unsupervised speaker adaptation for robust speech recognition in real environments
    Yamade, S
    Baba, A
    Yoshikawa, S
    Lee, A
    Saruwatari, H
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2005, 88 (08): : 30 - 41
  • [7] Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition
    Zhong, Shan
    Shan, Yuxiang
    He, Liang
    Liu, Jia
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2009, E92A (08) : 1892 - 1897
  • [8] Speaker recognition with session variability normalization based on MLLR adaptation transforms
    Stolcke, Andreas
    Kajarekar, Sachin S.
    Ferrer, Luciana
    Shrinberg, Elizabeth
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 1987 - 1998
  • [9] Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition
    Itoh, Arata
    Hara, Sunao
    Kitaoka, Norihide
    Takeda, Kazuya
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (10): : 2479 - 2485
  • [10] Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification
    Shih, Po-Yi
    Lin, Po-Chuan
    Wang, Jhing-Fa
    Lin, Yuan-Ning
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (05) : 1459 - 1467