PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION

被引:0
|
作者
Wang, Jianglin [1 ]
Johnson, Michael T. [1 ]
机构
[1] Marquette Univ, Dept Elect & Comp Engn, Speech & Signal Proc Lab, Milwaukee, WI 53233 USA
关键词
Speaker distinctive feature; Speaker identification; Glottal source excitation and GMM-UBM; VERIFICATION; PHASE; MFCC;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] On the Impact of Deep Learning and Feature Extraction for Arabic Audio Classification and Speaker Identification
    Shahriar, Sakib
    Dara, Rozita
    Hayawi, Kadhim
    2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,
  • [22] PRIVACY-AWARE FEATURE EXTRACTION FOR GENDER DISCRIMINATION VERSUS SPEAKER IDENTIFICATION
    Nelus, Alexandru
    Martin, Rainer
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 671 - 674
  • [23] Further feature extraction for speaker recognition
    Ma, ZY
    Yang, YC
    Wu, ZH
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4153 - 4158
  • [24] A Physiologically-Motivated Compartment-Based Model of the Effect of Inhaled Hypertonic Saline on Mucociliary Clearance and Liquid Transport in Cystic Fibrosis
    Markovetz, Matthew R.
    Corcoran, Timothy E.
    Locke, Landon W.
    Myerburg, Michael M.
    Pilewski, Joseph M.
    Parker, Robert S.
    PLOS ONE, 2014, 9 (11):
  • [25] Privacy-preserving Siamese Feature Extraction for Gender Recognition Versus Speaker Identification
    Nelus, Alexandra
    Rech, Silas
    Koppelmann, Timm
    Biermann, Henrik
    Martin, Rainer
    INTERSPEECH 2019, 2019, : 3705 - 3709
  • [26] Speech recognition as feature extraction for speaker recognition
    Stolcke, A.
    Shriberg, E.
    Ferrer, L.
    Kajarekar, S.
    Sonmez, K.
    Tur, G.
    2007 IEEE WORKSHOP ON SIGNAL PROCESSING APPLICATIONS FOR PUBLIC SECURITY AND FORENSICS, 2007, : 39 - +
  • [27] Feature Extraction Methods for Speaker Recognition: A Review
    Chaudhary, Gopal
    Srivastava, Smriti
    Bhardwaj, Saurabh
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (12)
  • [28] A Novel Feature Extraction Methods for Speaker Recognition
    Zou, Muchun
    COMMUNICATIONS AND INFORMATION PROCESSING, PT 1, 2012, 288 : 713 - 722
  • [29] Biologically Motivated Feature Extraction Using the Spiral Architecture
    Scotney, Bryan
    Coleman, Sonya
    Gardiner, Bryan
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 221 - 224
  • [30] Physiologically-inspired Feature Extraction for Emotion Recognition
    Zhou, Yu
    Sun, Yanqing
    Li, Junfeng
    Zhang, Jianping
    Yan, Yonghong
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1935 - +