Rank-Based Frame Classification for Usable Speech Detection in Speaker Identification Systems

被引:0
|
作者
Ethridge, James [1 ]
Ramachandran, Ravi P. [1 ]
机构
[1] Rowan Univ, Dept Elect & Comp Engn, Glassboro, NJ 08028 USA
关键词
speaker identification; usable frames; Gaussian mixture model; Mahalanobis distance; decision tree; boosting; additive noise;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The performance of a speaker identification (SID) system degrades substantially when there is a mismatch between the training and testing conditions. Discriminating between temporal sections of speech signals which are speech-like (SID usable) and noise-like (SID unusable) while only retaining frames labeled SID usable can augment SID performance substantially. In this paper, a novel labeling system for SID usable and SID unusable frames is presented for a GMM based SID system. This is motivated by a control experiment demonstrating that very high SID accuracies are theoretically achievable by removing frames that contribute more to the scores of competing speakers rather than the true speaker. To blindly identify these SID usable and unusable frames, the Mahalanobis distance and an ensemble of decision tree classifiers (with boosting) were trained on a dataset which was different from the enrollment database for the SID system. The classifier based techniques yielded improvements over the base speaker identification system (all frames used) in all cases when the speech signal was corrupted with additive white or additive pink noise.
引用
收藏
页码:292 / 296
页数:5
相关论文
共 50 条
  • [41] Rank-based energy scheduling strategy of networked microgrids in distribution systems
    Funde, Nitesh
    Yoon, Sung-Guk
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2022, 16 (01) : 84 - 98
  • [42] Rank-Based quality measurement of software systems in standardized source code
    Masud, Md. Raihan
    Abu Khaer, Md.
    Hashem, M. M. A.
    PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 424 - 429
  • [43] The DKU Speech Activity Detection and Speaker Identification Systems for Fearless Steps Challenge Phase-02
    Lin, Qingjian
    Li, Tingle
    Li, Ming
    INTERSPEECH 2020, 2020, : 2607 - 2611
  • [44] An Intelligent Technique for Android Malware Identification Using Fuzzy Rank-Based Fusion
    Taha, Altyeb
    Osman, Ahmed Hamza
    Baguda, Yakubu Suleiman
    TECHNOLOGIES, 2025, 13 (02)
  • [45] Speaker normalisation for speech-based emotion detection
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathainby
    Epps, Julien
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 611 - +
  • [46] INVENTORY BASED SPEECH ENHANCEMENT FOR SPEAKER DEDICATED SPEECH COMMUNICATION SYSTEMS
    Xiao, Xiaoqiang
    Lee, Peng
    Nickel, Robert M.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3877 - +
  • [47] RANK-BASED MULTIPLE CHANGE-POINT DETECTION IN MULTIVARIATE TIME SERIES
    Harle, F.
    Chatelain, F.
    Gouy-Pailler, C.
    Achard, S.
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1337 - 1341
  • [48] Ensemble methods of rank-based trees for single sample classification with gene expression profiles
    Lu, Min
    Yin, Ruijie
    Chen, X. Steven
    JOURNAL OF TRANSLATIONAL MEDICINE, 2024, 22 (01)
  • [49] Efficient pseudo-Gaussian and rank-based detection of random regression coefficients
    Fihri, Mohamed
    Akharif, Abdelhadi
    Mellouk, Amal
    Hallin, Marc
    JOURNAL OF NONPARAMETRIC STATISTICS, 2020, 32 (02) : 367 - 402
  • [50] Distribution-Free Detection of Structured Anomalies: Permutation and Rank-Based Scans
    Arias-Castro, Ery
    Castro, Rui M.
    Tanczos, Ervin
    Wang, Meng
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) : 789 - 801