Extraction of Glottal Features for Speaker Recognition

被引:0
|
作者
Ostrogonac, Stevan [1 ]
Secujski, Milan [1 ]
Knezevic, Dragan [1 ]
Suzic, Sinisa [1 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an extension to the SEDREAMS algorithm for extracting the information about glottal opening and glottal closure instants (GCI and GOI) directly from the speech signal. Accurate detection of GCIs and GOIs is crucial for estimating the glottal features which are to be used in speaker recognition systems. Many different approaches resulted in a variety of algorithms dealing with this problem. The algorithm that showed the best results so far consists of two steps. First, a mean-based signal is computed to determine the intervals in which the GCI and GOI moments should be searched for. Then, discontinuities are sought in the LP residual of the speech signal and they represent the estimation of glottal events. This algorithm (in literature found under the name SEDREAMS) is widely used in glottal excitation estimation systems. However, the mean-based signal calculated in the first step of the process sometimes contains unwanted spectral components which significantly degrade the performances. This paper describes one way to address this problem. By applying an adaptive filter to the mean-based signal significant improvement has been achieved in glottal features estimation. This was confirmed by a speaker recognition experiment which showed very encouraging results.
引用
收藏
页码:369 / 373
页数:5
相关论文
共 50 条
  • [21] Acoustic and facial features for speaker recognition
    Roach, MJ
    Brand, JD
    Mason, JSD
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 258 - 261
  • [22] SINGLE CHANNEL TARGET SPEAKER EXTRACTION AND RECOGNITION WITH SPEAKER BEAM
    Delcroix, Marc
    Zmolikova, Katerina
    Kinoshita, Keisuke
    Ogawa, Atsunori
    Nakatani, Tomohiro
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5554 - 5558
  • [23] Investigation of Glottal Features and Annotation Procedures for Speech Emotion Recognition
    Takebe, Masaaki
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [24] Analysis of Glottal Signals for Speaker Information
    Ramesh, K.
    Pradhan, Gayadhar
    Prasanna, S. R. Mahadeva
    2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
  • [25] Speaker recognition via nonlinear phonetic and speaker-discriminative features
    Stoll, Lara
    Frankel, Joe
    Mirghafori, Nikki
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2007, 4885 : 114 - 123
  • [26] Feature Extraction Methods for Speaker Recognition: A Review
    Chaudhary, Gopal
    Srivastava, Smriti
    Bhardwaj, Saurabh
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (12)
  • [27] A Novel Feature Extraction Methods for Speaker Recognition
    Zou, Muchun
    COMMUNICATIONS AND INFORMATION PROCESSING, PT 1, 2012, 288 : 713 - 722
  • [28] Speaker Verification based on extraction of Deep Features
    Mitsianis, Evangelos
    Spyrou, Evaggelos
    Giannakopoulos, Theodore
    10TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2018), 2018,
  • [29] Optimizing Features Extraction Parameters for Speaker Verification
    Impedovo, Donato
    Refice, Mario
    NEW ASPECTS OF SYSTEMS, PTS I AND II, 2008, : 498 - +
  • [30] Target Speaker Extraction by Fusing Voiceprint Features
    Cheng, Shidan
    Shen, Ying
    Wang, Dongqing
    APPLIED SCIENCES-BASEL, 2022, 12 (16):