Extraction of Glottal Features for Speaker Recognition

被引：0

作者：

Ostrogonac, Stevan ^{[1
]}

Secujski, Milan ^{[1
]}

Knezevic, Dragan ^{[1
]}

Suzic, Sinisa ^{[1
]}

机构：

[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia

来源：

IEEE 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL CYBERNETICS (ICCC 2013) | 2013年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an extension to the SEDREAMS algorithm for extracting the information about glottal opening and glottal closure instants (GCI and GOI) directly from the speech signal. Accurate detection of GCIs and GOIs is crucial for estimating the glottal features which are to be used in speaker recognition systems. Many different approaches resulted in a variety of algorithms dealing with this problem. The algorithm that showed the best results so far consists of two steps. First, a mean-based signal is computed to determine the intervals in which the GCI and GOI moments should be searched for. Then, discontinuities are sought in the LP residual of the speech signal and they represent the estimation of glottal events. This algorithm (in literature found under the name SEDREAMS) is widely used in glottal excitation estimation systems. However, the mean-based signal calculated in the first step of the process sometimes contains unwanted spectral components which significantly degrade the performances. This paper describes one way to address this problem. By applying an adaptive filter to the mean-based signal significant improvement has been achieved in glottal features estimation. This was confirmed by a speaker recognition experiment which showed very encouraging results.

引用

页码：369 / 373

页数：5

共 50 条

[1] On the Potential of Glottal Signatures for Speaker Recognition
Drugman, Thomas
Dutoit, Thierry
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2106 - 2109
[2] Recuperating spectral features using glottal information and its application to speaker recognition
Yang, P
Yang, YC
Wu, ZH
2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 2943 - 2946
[3] Speaker Discrimination Ability of Glottal Waveform Features
Torres, Juan Felix
Moore, Elliot
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1650 - 1653
[4] Extraction and representation of prosodic features for language and speaker recognition
Mary, Leena
Yegnanarayana, B.
SPEECH COMMUNICATION, 2008, 50 (10) : 782 - 796
[5] Exploiting glottal information in speaker recognition using parallel GMMs
Yang, P
Yang, YC
Wu, ZH
AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 804 - 812
[6] Combining the Glottal Mixture Model (GLOMM) with UBM for Speaker Recognition
Baggenstoss, Paul M.
2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 2156 - 2160
[7] Optimal MFCC Features Extraction by Differential Evolution Algorithm for Speaker Recognition
Sadeghi, Mohsen
Marvi, Hossein
2017 3RD IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2017, : 169 - 173
[8] Local features for speaker recognition
Paredes, R
Vidal, E
Casacuberta, F
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 1087 - 1095
[9] Glottal Doppler Radar System and Its Applications to Communication and Speaker Recognition
Chang, Chia-Chan
Lin, Chien-San
Chang, Sheng-Fuh
Lin, Chun-Chi
Jiang, Zhen-Qiang
Yu, Sung-Nien
2011 41ST EUROPEAN MICROWAVE CONFERENCE, 2011, : 1261 - 1264
[10] Glottal Doppler Radar System and Its Applications to Communication and Speaker Recognition
Chang, Chia-Chan
Lin, Chien-San
Chang, Sheng-Fuh
Lin, Chun-Chi
Jiang, Zhen-Qiang
Yu, Sung-Nien
2011 8TH EUROPEAN RADAR CONFERENCE, 2011, : 373 - 376

← 1 2 3 4 5 →