PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION

被引:0
|
作者
Wang, Jianglin [1 ]
Johnson, Michael T. [1 ]
机构
[1] Marquette Univ, Dept Elect & Comp Engn, Speech & Signal Proc Lab, Milwaukee, WI 53233 USA
关键词
Speaker distinctive feature; Speaker identification; Glottal source excitation and GMM-UBM; VERIFICATION; PHASE; MFCC;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Robust Feature Extraction Using Temporal Context Averaging for Speaker Identification in Diverse Acoustic Environments
    Terraf, Yassin
    Iraqi, Youssef
    IEEE ACCESS, 2024, 12 : 14094 - 14115
  • [32] The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices
    Al Hindawi, Noor Ahmad
    Shahin, Ismail
    Nassif, Ali Bou
    2021 14TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE), 2021, : 269 - 273
  • [33] EFFICIENT FEATURE EXTRACTION OF SPEAKER IDENTIFICATION USING PHONEME MEAN F-RATIO FOR CHINESE
    Zhao, Chen
    Wang, Hongcui
    Hyon, Songgun
    Wei, Jianguo
    Dang, Jianwu
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 345 - 348
  • [34] An Auditory-Based Feature Extraction Algorithm for Robust Speaker Identification Under Mismatched Conditions
    Li, Qi
    Huang, Yan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1791 - 1801
  • [35] On the use of nearest feature line for speaker identification
    Wu, TY
    Chen, K
    8TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, VOLS 1-3, PROCEEDING, 2001, : 1597 - 1602
  • [36] A Study on Feature Values as a Speaker Identification Method
    Nakamura, Etsuro
    Kageyama, Yoichi
    Shirasu, Motonari
    2020 JOINT 11TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 21ST INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS-ISIS), 2020, : 308 - 309
  • [37] On the use of nearest feature line for speaker identification
    Chen, K
    Wu, TY
    Zhang, HJ
    PATTERN RECOGNITION LETTERS, 2002, 23 (14) : 1735 - 1746
  • [38] Feature Extraction from Temporal Phase for Speaker Recognition
    Gandhi, Ami
    Patil, Hemant A.
    2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM 2018), 2018, : 382 - 386
  • [39] GENDER-DEPENDENT FEATURE EXTRACTION FOR SPEAKER RECOGNITION
    Li, Lantian
    Zheng, Thomas Fang
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 509 - 513
  • [40] New Feature Vector Extraction Method for Speaker Recognition
    Sukhostat, Lyudmila
    Imamverdiyev, Yadigar
    2012 IV INTERNATIONAL CONFERENCE PROBLEMS OF CYBERNETICS AND INFORMATICS (PCI), 2012,