Extraction of Glottal Features for Speaker Recognition

被引:0
|
作者
Ostrogonac, Stevan [1 ]
Secujski, Milan [1 ]
Knezevic, Dragan [1 ]
Suzic, Sinisa [1 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an extension to the SEDREAMS algorithm for extracting the information about glottal opening and glottal closure instants (GCI and GOI) directly from the speech signal. Accurate detection of GCIs and GOIs is crucial for estimating the glottal features which are to be used in speaker recognition systems. Many different approaches resulted in a variety of algorithms dealing with this problem. The algorithm that showed the best results so far consists of two steps. First, a mean-based signal is computed to determine the intervals in which the GCI and GOI moments should be searched for. Then, discontinuities are sought in the LP residual of the speech signal and they represent the estimation of glottal events. This algorithm (in literature found under the name SEDREAMS) is widely used in glottal excitation estimation systems. However, the mean-based signal calculated in the first step of the process sometimes contains unwanted spectral components which significantly degrade the performances. This paper describes one way to address this problem. By applying an adaptive filter to the mean-based signal significant improvement has been achieved in glottal features estimation. This was confirmed by a speaker recognition experiment which showed very encouraging results.
引用
收藏
页码:369 / 373
页数:5
相关论文
共 50 条
  • [1] On the Potential of Glottal Signatures for Speaker Recognition
    Drugman, Thomas
    Dutoit, Thierry
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2106 - 2109
  • [2] Recuperating spectral features using glottal information and its application to speaker recognition
    Yang, P
    Yang, YC
    Wu, ZH
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 2943 - 2946
  • [3] Speaker Discrimination Ability of Glottal Waveform Features
    Torres, Juan Felix
    Moore, Elliot
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1650 - 1653
  • [4] Extraction and representation of prosodic features for language and speaker recognition
    Mary, Leena
    Yegnanarayana, B.
    SPEECH COMMUNICATION, 2008, 50 (10) : 782 - 796
  • [5] Exploiting glottal information in speaker recognition using parallel GMMs
    Yang, P
    Yang, YC
    Wu, ZH
    AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 804 - 812
  • [6] Combining the Glottal Mixture Model (GLOMM) with UBM for Speaker Recognition
    Baggenstoss, Paul M.
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 2156 - 2160
  • [7] Optimal MFCC Features Extraction by Differential Evolution Algorithm for Speaker Recognition
    Sadeghi, Mohsen
    Marvi, Hossein
    2017 3RD IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2017, : 169 - 173
  • [8] Local features for speaker recognition
    Paredes, R
    Vidal, E
    Casacuberta, F
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 1087 - 1095
  • [9] Glottal Doppler Radar System and Its Applications to Communication and Speaker Recognition
    Chang, Chia-Chan
    Lin, Chien-San
    Chang, Sheng-Fuh
    Lin, Chun-Chi
    Jiang, Zhen-Qiang
    Yu, Sung-Nien
    2011 41ST EUROPEAN MICROWAVE CONFERENCE, 2011, : 1261 - 1264
  • [10] Glottal Doppler Radar System and Its Applications to Communication and Speaker Recognition
    Chang, Chia-Chan
    Lin, Chien-San
    Chang, Sheng-Fuh
    Lin, Chun-Chi
    Jiang, Zhen-Qiang
    Yu, Sung-Nien
    2011 8TH EUROPEAN RADAR CONFERENCE, 2011, : 373 - 376