Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition

被引：0

作者：

Lei, Howard ^{[1
]}

机构：

[1] Int Comp Sci Inst, Berkeley, CA 94704 USA

来源：

ADVANCES IN BIOMETRICS | 2009年 / 5558卷

关键词：

Text-dependent speaker recognition; mutual information; relevance; redundancy; data selection;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We developed measures relating feature vector distributions to speaker recognition (SR) performances for performance prediction and potential arbitrary data selection for SR. We examined the measures of mutual information, kurtosis, correlation, and measures pretaining to intra- and inter-speaker variability. We applied the measures on feature vectors of phones to determine which measures gave good SR, performance prediction of phones standalone and in combination. We found that mutual information had an -83.5% correlation with the Equal Error Rates (EERs) of each phone. Also, Pearson's correlation between the feature vectors of two phones had a -48.6% correalation with relative EER improvement of the score-level combination of the phones. When implemented in our new data-selection scheme (which does not require a SR system to be run), the measures allowed us to select data with 2.13% overall EER improvement (on SRE08) over data selected via a brute-force approach, at a fifth of the computational costs.

引用

页码：513 / 522

页数：10

共 50 条

[1] Importance of Nasality Measures for Speaker Recognition Data Selection and Performance Prediction
Lei, Howard
Lopez-Gonzalo, Eduardo
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 892 - 895
[2] Towards improving the performance of speaker recognition systems
Johnson, Neethu
George, Kuruvachan K.
Kumar, Santhosh C.
Raj, Reghu P. C.
2014 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND COMMUNICATIONS (ICCSC), 2014, : 38 - 41
[3] Maximum Entropy based Data Selection for Speaker Recognition
Huang, Chien-Lin
Ma, Bin
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2724 - 2727
[4] Data Selection with Kurtosis and Nasality features for Speaker Recognition
Lei, Howard
Mirghafori, Nikki
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2764 - +
[5] Tree-structured model selection and simulated-data adaptation for environmental and speaker robust speech recognition
Thatphithakkul, Nattanun
Kruatrachue, Boontee
Wutiwiwatchai, Chai
Marukatat, Sanparith
Boonpiam, Vataya
2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1570 - +
[6] Comparison of Generative and Discriminative Approaches for Speaker Recognition with Limited Data
Silovsky, Jan
Cerva, Petr
Zdansky, Jindrich
RADIOENGINEERING, 2009, 18 (03) : 307 - 316
[7] Evaluation on Data - Speaker Dependability Approaches for Speech Recognition Tasks
Saod, Aini Hafizah Mohd
Sulaiman, Siti Noraini
Harron, Nur Athiqah
Ahmad, Azizah
Ramlan, Siti Azura
Ramli, Dzati Athiar
2012 IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2012), 2012, : 254 - 258
[8] Speaker recognition - general and data fusion classifier approaches methods
Ramachandran, RP
Farrell, KR
Ramachandran, R
Mammone, RJ
PATTERN RECOGNITION, 2002, 35 (12) : 2801 - 2821
[9] Ensemble based speaker recognition using unsupervised data selection
Huang, Chien-Lin
Wang, Jia-Ching
Ma, Bin
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2016, 5
[10] Ensemble Classifiers Using Unsupervised Data Selection for Speaker Recognition
Huang, Chien-Lin
Hori, Chiori
Kashioka, Hideki
Ma, Bin
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2665 - +

← 1 2 3 4 5 →