Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition

被引:0
|
作者
Lei, Howard [1 ]
机构
[1] Int Comp Sci Inst, Berkeley, CA 94704 USA
来源
ADVANCES IN BIOMETRICS | 2009年 / 5558卷
关键词
Text-dependent speaker recognition; mutual information; relevance; redundancy; data selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We developed measures relating feature vector distributions to speaker recognition (SR) performances for performance prediction and potential arbitrary data selection for SR. We examined the measures of mutual information, kurtosis, correlation, and measures pretaining to intra- and inter-speaker variability. We applied the measures on feature vectors of phones to determine which measures gave good SR, performance prediction of phones standalone and in combination. We found that mutual information had an -83.5% correlation with the Equal Error Rates (EERs) of each phone. Also, Pearson's correlation between the feature vectors of two phones had a -48.6% correalation with relative EER improvement of the score-level combination of the phones. When implemented in our new data-selection scheme (which does not require a SR system to be run), the measures allowed us to select data with 2.13% overall EER improvement (on SRE08) over data selected via a brute-force approach, at a fifth of the computational costs.
引用
收藏
页码:513 / 522
页数:10
相关论文
共 50 条
  • [1] Importance of Nasality Measures for Speaker Recognition Data Selection and Performance Prediction
    Lei, Howard
    Lopez-Gonzalo, Eduardo
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 892 - 895
  • [2] Towards improving the performance of speaker recognition systems
    Johnson, Neethu
    George, Kuruvachan K.
    Kumar, Santhosh C.
    Raj, Reghu P. C.
    2014 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND COMMUNICATIONS (ICCSC), 2014, : 38 - 41
  • [3] Maximum Entropy based Data Selection for Speaker Recognition
    Huang, Chien-Lin
    Ma, Bin
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2724 - 2727
  • [4] Data Selection with Kurtosis and Nasality features for Speaker Recognition
    Lei, Howard
    Mirghafori, Nikki
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2764 - +
  • [5] Tree-structured model selection and simulated-data adaptation for environmental and speaker robust speech recognition
    Thatphithakkul, Nattanun
    Kruatrachue, Boontee
    Wutiwiwatchai, Chai
    Marukatat, Sanparith
    Boonpiam, Vataya
    2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1570 - +
  • [6] Comparison of Generative and Discriminative Approaches for Speaker Recognition with Limited Data
    Silovsky, Jan
    Cerva, Petr
    Zdansky, Jindrich
    RADIOENGINEERING, 2009, 18 (03) : 307 - 316
  • [7] Evaluation on Data - Speaker Dependability Approaches for Speech Recognition Tasks
    Saod, Aini Hafizah Mohd
    Sulaiman, Siti Noraini
    Harron, Nur Athiqah
    Ahmad, Azizah
    Ramlan, Siti Azura
    Ramli, Dzati Athiar
    2012 IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2012), 2012, : 254 - 258
  • [8] Speaker recognition - general and data fusion classifier approaches methods
    Ramachandran, RP
    Farrell, KR
    Ramachandran, R
    Mammone, RJ
    PATTERN RECOGNITION, 2002, 35 (12) : 2801 - 2821
  • [9] Ensemble based speaker recognition using unsupervised data selection
    Huang, Chien-Lin
    Wang, Jia-Ching
    Ma, Bin
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2016, 5
  • [10] Ensemble Classifiers Using Unsupervised Data Selection for Speaker Recognition
    Huang, Chien-Lin
    Hori, Chiori
    Kashioka, Hideki
    Ma, Bin
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2665 - +