Rank-Based Frame Classification for Usable Speech Detection in Speaker Identification Systems

被引:0
|
作者
Ethridge, James [1 ]
Ramachandran, Ravi P. [1 ]
机构
[1] Rowan Univ, Dept Elect & Comp Engn, Glassboro, NJ 08028 USA
关键词
speaker identification; usable frames; Gaussian mixture model; Mahalanobis distance; decision tree; boosting; additive noise;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The performance of a speaker identification (SID) system degrades substantially when there is a mismatch between the training and testing conditions. Discriminating between temporal sections of speech signals which are speech-like (SID usable) and noise-like (SID unusable) while only retaining frames labeled SID usable can augment SID performance substantially. In this paper, a novel labeling system for SID usable and SID unusable frames is presented for a GMM based SID system. This is motivated by a control experiment demonstrating that very high SID accuracies are theoretically achievable by removing frames that contribute more to the scores of competing speakers rather than the true speaker. To blindly identify these SID usable and unusable frames, the Mahalanobis distance and an ensemble of decision tree classifiers (with boosting) were trained on a dataset which was different from the enrollment database for the SID system. The classifier based techniques yielded improvements over the base speaker identification system (all frames used) in all cases when the speech signal was corrupted with additive white or additive pink noise.
引用
收藏
页码:292 / 296
页数:5
相关论文
共 50 条
  • [1] A novel rank-based classifier combination scheme for speaker identification
    Altinçay, H
    Demirekler, M
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1209 - 1212
  • [2] Rank-based ordinal classification
    Serrat, Joan
    Ruiz, Idoia
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8069 - 8076
  • [3] Enhancing speaker identification in criminal investigations through clusterization and rank-based scoring
    Moura, Antonio Artur
    Nepomuceno, Napoleao
    Furtado, Vasco
    FORENSIC SCIENCE INTERNATIONAL-DIGITAL INVESTIGATION, 2024, 49
  • [4] Developing usable speech criteria for speaker identification technology
    Lovekin, JM
    Yantorno, RE
    Krishnamachari, KR
    Benincasa, DS
    Wenndt, SJ
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 421 - 424
  • [5] Rank-based outlier detection
    Huang, Huaming
    Mehrotra, Kishan
    Mohan, Chilukuri K.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2013, 83 (03) : 518 - 531
  • [6] Rank-based autoregressive order identification
    Garel, B
    Hallin, M
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (448) : 1357 - 1371
  • [7] Rank-Based Nondomination Set Identification with Preprocessing
    Palakonda, Vikas
    Mallipeddi, Rammohan
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2016, PT II, 2016, 9713 : 150 - 157
  • [8] Rank-Based Classification Using Robust Discriminant Functions
    Abebe, Asheber
    Nudurupati, Sai V.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2009, 38 (02) : 199 - 214
  • [9] RANK-BASED SLOCC CLASSIFICATION FOR ODD N QUBITS
    Li, Xiangrong
    Li, Dafa
    QUANTUM INFORMATION & COMPUTATION, 2011, 11 (7-8) : 695 - 705
  • [10] On the Performance of Variable Selection and Classification via Rank-Based Classifier
    Sarker, Md Showaib Rahman
    Pokojovy, Michael
    Kim, Sangjin
    MATHEMATICS, 2019, 7 (05)