Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition

被引:0
|
作者
Lei, Howard [1 ]
机构
[1] Int Comp Sci Inst, Berkeley, CA 94704 USA
来源
ADVANCES IN BIOMETRICS | 2009年 / 5558卷
关键词
Text-dependent speaker recognition; mutual information; relevance; redundancy; data selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We developed measures relating feature vector distributions to speaker recognition (SR) performances for performance prediction and potential arbitrary data selection for SR. We examined the measures of mutual information, kurtosis, correlation, and measures pretaining to intra- and inter-speaker variability. We applied the measures on feature vectors of phones to determine which measures gave good SR, performance prediction of phones standalone and in combination. We found that mutual information had an -83.5% correlation with the Equal Error Rates (EERs) of each phone. Also, Pearson's correlation between the feature vectors of two phones had a -48.6% correalation with relative EER improvement of the score-level combination of the phones. When implemented in our new data-selection scheme (which does not require a SR system to be run), the measures allowed us to select data with 2.13% overall EER improvement (on SRE08) over data selected via a brute-force approach, at a fifth of the computational costs.
引用
收藏
页码:513 / 522
页数:10
相关论文
共 50 条
  • [31] Speaker Recognition Exploiting D2D Communications Paradigm: Performance Evaluation of Multiple Observations Approaches
    Igor Bisio
    Fabio Lavagetto
    Chiara Garibotto
    Andrea Sciarrone
    Mobile Networks and Applications, 2017, 22 : 1045 - 1057
  • [32] Speaker Recognition Exploiting D2D Communications Paradigm: Performance Evaluation of Multiple Observations Approaches
    Bisio, Igor
    Lavagetto, Fabio
    Garibotto, Chiara
    Sciarrone, Andrea
    MOBILE NETWORKS & APPLICATIONS, 2017, 22 (06): : 1045 - 1057
  • [33] Towards the Selection of Best Machine Learning Model for Student Performance Analysis and Prediction
    Masood, Muhammad Faisal
    Khan, Aimal
    Hussain, Farhan
    Shaukat, Arslan
    Zeb, Babar
    Ullah, Rana Muhammad Kaleem
    2019 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2019), 2019, : 12 - 17
  • [34] Influence of Different Feature Selection Approaches on the Performance of Emotion Recognition Methods Based on SVM
    Belkov, Daniil
    Purtov, Konstantin
    Kublanov, Vladimir
    PROCEEDINGS OF THE 20TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT 2017), 2017, : 40 - 45
  • [35] Efficient Approaches for Solving the Large-Scale k-Medoids Problem: Towards Structured Data
    Martino, Alessio
    Rizzi, Antonello
    Mascioli, Fabio Massimo Frattale
    COMPUTATIONAL INTELLIGENCE, IJCCI 2017, 2019, 829 : 199 - 219
  • [36] Performance prediction model for cloud service selection from smart data
    Al-Faifi, Abdullah Mohammed
    Song, Biao
    Hassan, Mohammad Mehedi
    Alamri, Atif
    Gumaei, Abdu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 85 : 97 - 106
  • [37] PepAls: Performance Prediction and Algorithm Selection Framework for Data Mining Applications
    You, Mingyu
    Xu, Xuanhui
    Wang, Zheng
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4648 - 4657
  • [38] Towards the Prediction of the Performance and Energy Efficiency of Distributed Data Management Systems
    Niemann, Raik
    ICPE'16 COMPANION: PROCEEDINGS OF THE 2016 COMPANION PUBLICATION FOR THE ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, 2016, : 23 - 28
  • [39] Performance Analysis of Named Entity Recognition Approaches on Code-Mixed Data
    Gaddamidi, Sreeja
    Prasath, Rajendra
    INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY (ICICCT 2021), 2021, 1417 : 153 - 167
  • [40] Effective background data selection for SVM-based speaker recognition with unseen test environments: More is not always better
    Hansen J.H.L.
    Suh J.-W.
    Angkititrakul P.
    Lei Y.
    International Journal of Speech Technology, 2014, 17 (03) : 211 - 221