Extraction of Glottal Features for Speaker Recognition

被引：0

作者：

Ostrogonac, Stevan ^{[1
]}

Secujski, Milan ^{[1
]}

Knezevic, Dragan ^{[1
]}

Suzic, Sinisa ^{[1
]}

机构：

[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia

来源：

IEEE 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL CYBERNETICS (ICCC 2013) | 2013年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an extension to the SEDREAMS algorithm for extracting the information about glottal opening and glottal closure instants (GCI and GOI) directly from the speech signal. Accurate detection of GCIs and GOIs is crucial for estimating the glottal features which are to be used in speaker recognition systems. Many different approaches resulted in a variety of algorithms dealing with this problem. The algorithm that showed the best results so far consists of two steps. First, a mean-based signal is computed to determine the intervals in which the GCI and GOI moments should be searched for. Then, discontinuities are sought in the LP residual of the speech signal and they represent the estimation of glottal events. This algorithm (in literature found under the name SEDREAMS) is widely used in glottal excitation estimation systems. However, the mean-based signal calculated in the first step of the process sometimes contains unwanted spectral components which significantly degrade the performances. This paper describes one way to address this problem. By applying an adaptive filter to the mean-based signal significant improvement has been achieved in glottal features estimation. This was confirmed by a speaker recognition experiment which showed very encouraging results.

引用

页码：369 / 373

页数：5

共 50 条

[21] Acoustic and facial features for speaker recognition
Roach, MJ
Brand, JD
Mason, JSD
15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 258 - 261
[22] SINGLE CHANNEL TARGET SPEAKER EXTRACTION AND RECOGNITION WITH SPEAKER BEAM
Delcroix, Marc
Zmolikova, Katerina
Kinoshita, Keisuke
Ogawa, Atsunori
Nakatani, Tomohiro
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5554 - 5558
[23] Investigation of Glottal Features and Annotation Procedures for Speech Emotion Recognition
Takebe, Masaaki
Yamamoto, Kazumasa
Nakagawa, Seiichi
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[24] Analysis of Glottal Signals for Speaker Information
Ramesh, K.
Pradhan, Gayadhar
Prasanna, S. R. Mahadeva
2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
[25] Speaker recognition via nonlinear phonetic and speaker-discriminative features
Stoll, Lara
Frankel, Joe
Mirghafori, Nikki
ADVANCES IN NONLINEAR SPEECH PROCESSING, 2007, 4885 : 114 - 123
[26] Feature Extraction Methods for Speaker Recognition: A Review
Chaudhary, Gopal
Srivastava, Smriti
Bhardwaj, Saurabh
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (12)
[27] A Novel Feature Extraction Methods for Speaker Recognition
Zou, Muchun
COMMUNICATIONS AND INFORMATION PROCESSING, PT 1, 2012, 288 : 713 - 722
[28] Speaker Verification based on extraction of Deep Features
Mitsianis, Evangelos
Spyrou, Evaggelos
Giannakopoulos, Theodore
10TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2018), 2018,
[29] Optimizing Features Extraction Parameters for Speaker Verification
Impedovo, Donato
Refice, Mario
NEW ASPECTS OF SYSTEMS, PTS I AND II, 2008, : 498 - +
[30] Target Speaker Extraction by Fusing Voiceprint Features
Cheng, Shidan
Shen, Ying
Wang, Dongqing
APPLIED SCIENCES-BASEL, 2022, 12 (16):

← 1 2 3 4 5 →