Robust model for speaker verification against session-dependent utterance variation

被引：0

作者：

Matsui, T ^{[1
]}

Aikawa, K

机构：

[1] Inst Stat Math, Tokyo 1068569, Japan

[2] NTT Corp, NTT Commun Sci Labs, Tokyo 1008116, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2003年 / E86D卷 / 04期

关键词：

speaker verification; speaker model; session dependent; utterance variation; handset dependent distortion;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a new method for creating robust speaker models to cope with inter-session variation of a speaker in a continuous HMM-based speaker verification system. The new method estimates session-independent parameters by decomposing inter-session variations into two distinct parts: session-dependent and -independent. The parameters of the speaker models are estimated using the speaker adaptive training algorithm in conjunction with the equalization of session-dependent variation. The resultant models capture the session-independent speaker characteristics more reliably than the conventional models and their discriminative power improves accordingly. Moreover we have made our models more invariant to handset variations in a public switched telephone network (PSTN) by focusing on session-dependent variation and handset-dependent distortion separately. Text-independent speech data recorded by 20 speakers in seven sessions over 16 months was used to evaluate the new approach. The proposed method reduces the error rate by 15% relatively. When compared with the popular cepstral mean normalization, the error rate is reduced by 24% relatively when the speaker models were recreated using speech data recorded in four or more sessions.

引用

页码：712 / 718

页数：7

共 49 条

[1] Robust model for speaker verification against session-dependent utterance variation
Matsui, T
Aikawa, K
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 117 - 120
[2] Robust speaker recognition against utterance variations
Lee, JJ
Rheem, JY
Lee, KY
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2003, PT 2, PROCEEDINGS, 2003, 2668 : 624 - 630
[3] Model selection and score normalization for text-dependent single utterance speaker verification
Buyuk, Osman
Arslan, Mustafa Levent
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2012, 20 : 1277 - 1295
[4] A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification
Aronowitz, H
Burshtein, D
Amir, A
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 733 - 736
[5] Robust Session Variability Compensation for SVM Speaker Verification
Seo, Hyunson
Jung, Chi-Sang
Kang, Hong-Goo
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
[6] Robust Speaker Verification Against Additive Noise
Wang, Ming-He
Zhang, Er-Hua
Tang, Zhen-Min
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2019, 35 (02) : 291 - 305
[7] Robust Methods for Text-Dependent Speaker Verification
Bhukya, Ramesh K.
Prasanna, S. R. Mahadeva
Sarma, Biswajit Dev
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (11) : 5253 - 5288
[8] Robust Methods for Text-Dependent Speaker Verification
Ramesh K. Bhukya
S. R. Mahadeva Prasanna
Biswajit Dev Sarma
Circuits, Systems, and Signal Processing, 2019, 38 : 5253 - 5288
[9] Robust Training for Speaker Verification against Noisy Labels
Fang, Zhihua
He, Liang
Ma, Hanhan
Guo, Xiaochen
Li, Lin
INTERSPEECH 2023, 2023, : 3192 - 3196
[10] Phoneme dependent inter-session variability reduction for speaker verification
Lu, Haoze
Zhang, Wenbin
Horiuchi, Yasuo
Kuroiwa, Shingo
INTERNATIONAL JOURNAL OF BIOMETRICS, 2015, 7 (02) : 83 - 96

← 1 2 3 4 5 →