Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation

被引：0

作者：

Gales, MJF

Pye, D

Woodland, PC

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper investigates the use of maximum likelihood linear regression (MLLR) for both speaker and environment adaptation. MLLR transforms the mean and variance parameters of a set of HMMs. In this paper a number of different types of linear transformations of the variances are examined including full, block diagonal, and diagonal transformation matrices. Experiments on large vocabulary speaker independent data sets are described. On all the data sets examined the use of MLLR mean and variance compensation reduced the error rate compared to mean-only compensation. Furthermore, the use of a block diagonal or full transformation of the variances on the clean data task showed slight improvements over the diagonal case. However, when some environmental mismatch was present then was no difference in performance between using multiple diagonal variance transformations and a more complex single variance transform.

引用

页码：1832 / 1835

页数：4

共 50 条

[41] MLLR/MAP Adaptation Using Pronunciation Variation for Non-native Speech Recognition
Oh, Yoo Rhee
Kim, Hong Kook
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 216 - 221
[42] Noise robust estimate of speech dynamics for speaker recognition
Openshaw, JP
Mason, JS
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 925 - 928
[43] Channel Robust MFCCs for Continuous Speech Speaker Recognition
Chougule, Sharada Vikram
Chavan, Mahesh S.
ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 557 - 568
[44] Efficient Speaker and Noise Normalization for Robust Speech Recognition
Joshi, Vikas
Bilgi, Raghavendra
Umesh, S.
Benitez, C.
Garcia, L.
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2612 - 2615
[45] Robust speech recognition with speaker localization by a microphone array
Yamada, T
Nakamura, S
Shikano, K
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1317 - 1320
[46] An Integrated Approach to Robust Speaker Identification and Speech Recognition
Kwan, C.
Yin, J.
Ayhan, B.
Chu, S.
Liu, X.
Puckett, K.
Zhao, Y.
Ho, K. C.
Kruger, M.
Sityar, I.
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1635 - +
[47] A Fused Speech Enhancement Framework for Robust Speaker Verification
Wu, Yanfeng
Li, Taihao
Zhao, Junan
Wang, Qirui
Xu, Jing
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 883 - 887
[48] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
Kim, WG
Jang, M
COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
[49] Analysis of Cross-gender Adaptation using MAP and MLLR in Speech Recognition Systems
Mahiba, Magdalene S.
Christina, Lilly S.
Vijayalakshmi, P.
Nagarajan, T.
2013 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2013, : 387 - 392
[50] SPEAKER ADAPTATION OF RNN-BLSTM FOR SPEECH RECOGNITION BASED ON SPEAKER CODE
Huang, Zhiying
Tang, Jian
Xue, Shaofei
Dai, Lirong
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5305 - 5309

← 1 2 3 4 5 →