Long span prosodic features for speaker recognition

被引：0

作者：

Zhang, Jianping ^{[1
]}

Li, Ming ^{[1
]}

Suo, Hongbin ^{[1
]}

Yang, Lin ^{[1
]}

Fu, Qiang ^{[1
]}

Yan, Yonghong ^{[1
]}

机构：

[1] ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing 100190, China

来源：

Shengxue Xuebao/Acta Acustica | 2010年 / 35卷 / 02期

关键词：

Continuous speech recognition - Polynomial approximation;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we first give an introduction about speaker recognition techniques. Then a novel speaker verification method based on long span prosodic features is proposed. After speech is pre-processed by a voice activity detection module, and basic prosody features are extracted for each speech unit, we carried out an approximation of the pitch, formant, time domain energy and harmonic energy contours by taking the leading terms in a Legendre polynomial expansion. HLDA is used to reduce the feature dimension and mean supervector in each individual Gaussian is used to represent the distribution of the time-frequency features. Experiments on NIST06 show that the proposed method can reduce the EER from 4.9% to 4.6% when fusing with the traditional MFCC-featured system. © Right.

引用

页码：267 / 269

共 50 条

[21] Pertinent Prosodic Features for Speaker Identification by Voice
Sayoud, Halim
Ouamour, Siham
INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2010, 2 (02) : 18 - 33
[22] Effect of VoIP on Prosodic Features for Speaker Verification
Cherian, Athira Jess
Antony, Anil P.
Mary, Leena
2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 487 - 490
[23] Improvement of speaker identification by combining prosodic features with acoustic features
Zheng, R
Zhang, SW
Xu, B
ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2004, 3338 : 569 - 576
[24] SPEAKER-SPECIFIC PROSODIC FEATURES IN CONFLICT DISCOURSE
Seyranyan, Margarita Yuryevna
VESTNIK VOLGOGRADSKOGO GOSUDARSTVENNOGO UNIVERSITETA-SERIYA 2-YAZYKOZNANIE, 2015, 14 (01): : 150 - 157
[25] The Detection of Overlapping Speech with Prosodic Features for Speaker Diarization
Zelenak, Martin
Hernando, Javier
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1048 - 1051
[26] iVector Fusion of Prosodic and Cepstral Features for Speaker Verification
Kockmann, Marcel
Ferrer, Luciana
Burget, Lukas
Cernocky, Jan Honza
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 272 - 275
[27] Speaker Dependent Emotion Recognition Using Prosodic Supervectors
Lopez-Moreno, Ignacio
Ortego-Resa, Carlos
Gonzalez-Rodriguez, Joaquin
Ramos, Daniel
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1939 - 1942
[28] Affect-insensitive speaker recognition systems via emotional speech clustering using prosodic features
Dongdong Li
Yubo Yuan
Zhaohui Wu
Yingchun Yang
Neural Computing and Applications, 2015, 26 : 473 - 484
[29] Affect-insensitive speaker recognition systems via emotional speech clustering using prosodic features
Li, Dongdong
Yuan, Yubo
Wu, Zhaohui
Yang, Yingchun
NEURAL COMPUTING & APPLICATIONS, 2015, 26 (02): : 473 - 484
[30] Spoken Language Recognition With Prosodic Features
Ng, Raymond W. M.
Lee, Tan
Leung, Cheung-Chi
Ma, Bin
Li, Haizhou
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1841 - 1853

← 1 2 3 4 5 →