PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION

被引:0
|
作者
Wang, Jianglin [1 ]
Johnson, Michael T. [1 ]
机构
[1] Marquette Univ, Dept Elect & Comp Engn, Speech & Signal Proc Lab, Milwaukee, WI 53233 USA
关键词
Speaker distinctive feature; Speaker identification; Glottal source excitation and GMM-UBM; VERIFICATION; PHASE; MFCC;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] The Research of Feature Extraction Based on MFCC for Speaker Recognition
    Zhang Wanli
    Li Guoxin
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1074 - 1077
  • [42] A general framework of feature extraction: Application to speaker recognition
    Liu, CS
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 669 - 672
  • [43] Identity feature extraction scheme of curvelets for speaker recognition
    Wang Jinfang
    Wang Jinbao
    Zhao Xiaojing
    2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 29 - +
  • [44] Feature Extraction and Classification Techniques for Speaker Recognition: A Review
    Dhameliya, Kinnal
    Bhatt, Ninad
    2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, SIGNALS, COMMUNICATION AND OPTIMIZATION (EESCO), 2015,
  • [45] A pitch synchronous feature extraction method for speaker recognition
    Kim, S
    Eriksson, T
    Kang, HG
    Youn, DH
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 405 - 408
  • [46] MULTI-FEATURE INTEGRATION FOR SPEAKER EMBEDDING EXTRACTION
    Sankala, Sreekanth
    Rafi, Shaik Mohammad B.
    Murty, Sri Rama K.
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7957 - 7961
  • [47] A Multiscale Chaotic Feature Extraction Method for Speaker Recognition
    Jiang Lin
    Yi Yumei
    Zhang Maosheng
    Chen Defeng
    Wang Chao
    Wang Tonghan
    COMPLEXITY, 2020, 2020
  • [48] An Auditory Feature Extraction Method for Robust Speaker Recognition
    Hu, Fengsong
    Cao, Xiaoyu
    PROCEEDINGS OF 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, 2012, : 1067 - 1071
  • [49] Privacy-preserving Variational Information Feature Extraction for Domestic Activity Monitoring Versus Speaker Identification
    Nelus, Alexandru
    Ebbers, Janek
    Haeb-Umbach, Reinhold
    Martin, Rainer
    INTERSPEECH 2019, 2019, : 3710 - 3714
  • [50] Physiological feature extraction for text independent speaker identification using non-uniform subband processing
    Lu, Xugang
    Dang, Jianwu
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 461 - +