Long span prosodic features for speaker recognition

被引:0
|
作者
Zhang, Jianping [1 ]
Li, Ming [1 ]
Suo, Hongbin [1 ]
Yang, Lin [1 ]
Fu, Qiang [1 ]
Yan, Yonghong [1 ]
机构
[1] ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing 100190, China
来源
Shengxue Xuebao/Acta Acustica | 2010年 / 35卷 / 02期
关键词
Continuous speech recognition - Polynomial approximation;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we first give an introduction about speaker recognition techniques. Then a novel speaker verification method based on long span prosodic features is proposed. After speech is pre-processed by a voice activity detection module, and basic prosody features are extracted for each speech unit, we carried out an approximation of the pitch, formant, time domain energy and harmonic energy contours by taking the leading terms in a Legendre polynomial expansion. HLDA is used to reduce the feature dimension and mean supervector in each individual Gaussian is used to represent the distribution of the time-frequency features. Experiments on NIST06 show that the proposed method can reduce the EER from 4.9% to 4.6% when fusing with the traditional MFCC-featured system. © Right.
引用
收藏
页码:267 / 269
相关论文
共 50 条
  • [21] Pertinent Prosodic Features for Speaker Identification by Voice
    Sayoud, Halim
    Ouamour, Siham
    INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2010, 2 (02) : 18 - 33
  • [22] Effect of VoIP on Prosodic Features for Speaker Verification
    Cherian, Athira Jess
    Antony, Anil P.
    Mary, Leena
    2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 487 - 490
  • [23] Improvement of speaker identification by combining prosodic features with acoustic features
    Zheng, R
    Zhang, SW
    Xu, B
    ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2004, 3338 : 569 - 576
  • [24] SPEAKER-SPECIFIC PROSODIC FEATURES IN CONFLICT DISCOURSE
    Seyranyan, Margarita Yuryevna
    VESTNIK VOLGOGRADSKOGO GOSUDARSTVENNOGO UNIVERSITETA-SERIYA 2-YAZYKOZNANIE, 2015, 14 (01): : 150 - 157
  • [25] The Detection of Overlapping Speech with Prosodic Features for Speaker Diarization
    Zelenak, Martin
    Hernando, Javier
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1048 - 1051
  • [26] iVector Fusion of Prosodic and Cepstral Features for Speaker Verification
    Kockmann, Marcel
    Ferrer, Luciana
    Burget, Lukas
    Cernocky, Jan Honza
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 272 - 275
  • [27] Speaker Dependent Emotion Recognition Using Prosodic Supervectors
    Lopez-Moreno, Ignacio
    Ortego-Resa, Carlos
    Gonzalez-Rodriguez, Joaquin
    Ramos, Daniel
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1939 - 1942
  • [28] Affect-insensitive speaker recognition systems via emotional speech clustering using prosodic features
    Dongdong Li
    Yubo Yuan
    Zhaohui Wu
    Yingchun Yang
    Neural Computing and Applications, 2015, 26 : 473 - 484
  • [29] Affect-insensitive speaker recognition systems via emotional speech clustering using prosodic features
    Li, Dongdong
    Yuan, Yubo
    Wu, Zhaohui
    Yang, Yingchun
    NEURAL COMPUTING & APPLICATIONS, 2015, 26 (02): : 473 - 484
  • [30] Spoken Language Recognition With Prosodic Features
    Ng, Raymond W. M.
    Lee, Tan
    Leung, Cheung-Chi
    Ma, Bin
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1841 - 1853