Long span prosodic features for speaker recognition

被引:0
|
作者
Zhang, Jianping [1 ]
Li, Ming [1 ]
Suo, Hongbin [1 ]
Yang, Lin [1 ]
Fu, Qiang [1 ]
Yan, Yonghong [1 ]
机构
[1] ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing 100190, China
来源
Shengxue Xuebao/Acta Acustica | 2010年 / 35卷 / 02期
关键词
Continuous speech recognition - Polynomial approximation;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we first give an introduction about speaker recognition techniques. Then a novel speaker verification method based on long span prosodic features is proposed. After speech is pre-processed by a voice activity detection module, and basic prosody features are extracted for each speech unit, we carried out an approximation of the pitch, formant, time domain energy and harmonic energy contours by taking the leading terms in a Legendre polynomial expansion. HLDA is used to reduce the feature dimension and mean supervector in each individual Gaussian is used to represent the distribution of the time-frequency features. Experiments on NIST06 show that the proposed method can reduce the EER from 4.9% to 4.6% when fusing with the traditional MFCC-featured system. © Right.
引用
收藏
页码:267 / 269
相关论文
共 50 条
  • [31] Local features for speaker recognition
    Paredes, R
    Vidal, E
    Casacuberta, F
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 1087 - 1095
  • [32] Parameterization of prosodic feature distributions for SVM modeling in speaker recognition
    Ferrer, Luciana
    Shriberg, Elizabeth
    Kajarekar, Sachin
    Soenmez, Kemal
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 233 - +
  • [33] Emerging features for speaker recognition
    Ambikairajah, Eliathamby
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1690 - 1696
  • [34] AN ADAPTIVE INITIALIZATION METHOD FOR SPEAKER DIARIZATION BASED ON PROSODIC FEATURES
    Imseng, David
    Friedland, Gerald
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4946 - 4949
  • [35] Modeling prosodic features with joint factor analysis for speaker verification
    Dehak, Najim
    Dumouchel, Pierre
    Kenny, Patrick
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2095 - 2103
  • [36] Using prosodic and conversational features for high-performance speaker recognition: Report from JHU WS'02
    Peskin, B
    Navratil, J
    Abramson, J
    Jones, D
    Klusacek, D
    Reynolds, DA
    Xiang, B
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: SIGNAL PROCESSING FOR COMMUNICATIONS SPECIAL SESSIONS, 2003, : 792 - 795
  • [37] SPEAKER RECOGNITION BY STATISTICAL FEATURES AND DYNAMIC FEATURES
    FURUI, S
    REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1982, 30 (03): : 467 - 482
  • [38] Improving Named Entity Recognition with Prosodic Features
    Katerenchuk, Denys
    Rosenberg, Andrew
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 293 - 297
  • [39] Prosodic and Voice Quality Features for Speaker Verification Over Coded Channel
    Polacky, Jozef
    Chmulik, Michal
    Jarina, Roman
    2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2016, : 327 - 330
  • [40] Modeling prosodic features with probabilistic linear discriminant analysis for speaker verification
    Liang, Chunyan
    Yang, Lin
    Zhou, Ruohua
    Yan, Yonghong
    Shengxue Xuebao/Acta Acustica, 2015, 40 (01): : 28 - 33