Extraction of Glottal Features for Speaker Recognition

被引：0

作者：

Ostrogonac, Stevan ^{[1
]}

Secujski, Milan ^{[1
]}

Knezevic, Dragan ^{[1
]}

Suzic, Sinisa ^{[1
]}

机构：

[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia

来源：

IEEE 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL CYBERNETICS (ICCC 2013) | 2013年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an extension to the SEDREAMS algorithm for extracting the information about glottal opening and glottal closure instants (GCI and GOI) directly from the speech signal. Accurate detection of GCIs and GOIs is crucial for estimating the glottal features which are to be used in speaker recognition systems. Many different approaches resulted in a variety of algorithms dealing with this problem. The algorithm that showed the best results so far consists of two steps. First, a mean-based signal is computed to determine the intervals in which the GCI and GOI moments should be searched for. Then, discontinuities are sought in the LP residual of the speech signal and they represent the estimation of glottal events. This algorithm (in literature found under the name SEDREAMS) is widely used in glottal excitation estimation systems. However, the mean-based signal calculated in the first step of the process sometimes contains unwanted spectral components which significantly degrade the performances. This paper describes one way to address this problem. By applying an adaptive filter to the mean-based signal significant improvement has been achieved in glottal features estimation. This was confirmed by a speaker recognition experiment which showed very encouraging results.

引用

页码：369 / 373

页数：5

共 50 条

[31] Fusion of acoustic and tokenization features for speaker recognition
Tong, Rong
Ma, Bin
Lee, Kong-Aik
You, Changhuai
Zhu, Donglai
Kinnunen, Tomi
Sun, Hanwu
Dong, Minghui
Chng, Eng-Siong
Li, Haizhou
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 566 - +
[32] PARALLEL TRANSFORMATION NETWORK FEATURES FOR SPEAKER RECOGNITION
Abad, Alberto
Luque, Jordi
Trancoso, Isabel
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5300 - 5303
[33] Speaker Recognition using Spectral Dimension Features
Chen, Wen-Shiung
Huang, Jr-Feng
2009 FOURTH INTERNATIONAL MULTI-CONFERENCE ON COMPUTING IN THE GLOBAL INFORMATION TECHNOLOGY (ICCGI 2009), 2009, : 132 - 137
[34] Long span prosodic features for speaker recognition
Zhang, Jianping
Li, Ming
Suo, Hongbin
Yang, Lin
Fu, Qiang
Yan, Yonghong
Shengxue Xuebao/Acta Acustica, 2010, 35 (02): : 267 - 269
[35] On the use of complementary spectral features for speaker recognition
Hosseinzadeh, Danoush
Krishnan, Sridhar
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)
[36] FM Features for Automatic Forensic Speaker Recognition
Thiruvaran, Tharmarajah
Ambikairajah, Eliathamby
Epps, Julien
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1497 - 1500
[37] SURVEY AND EVALUATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
Lawson, A.
Vabishchevich, P.
Huggins, M.
Ardis, P.
Battles, B.
Stauffer, A.
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5444 - 5447
[38] Looking for relevant features for speaker role recognition
IRIT, Unversité de Toulouse, 118 route de Narbonne, F-31062 Toulouse Cedex 9, France
Proc. Annu. Conf. Int. Speech Commun. Assoc., INTERSPEECH, (1057-1060):
[39] Speaker recognition using prosodic and lexical features
Kajarekar, S
Ferrer, L
Venkataraman, A
Sonmez, K
Shriberg, E
Stolcke, A
Bratt, H
Gadde, RR
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 19 - 24
[40] Integration of complementary acoustic features for speaker recognition
Zheng, Nengheng
Lee, Tan
Ching, P. C.
IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (03) : 181 - 184

← 1 2 3 4 5 →