Extraction of Glottal Features for Speaker Recognition

被引:0
|
作者
Ostrogonac, Stevan [1 ]
Secujski, Milan [1 ]
Knezevic, Dragan [1 ]
Suzic, Sinisa [1 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Novi Sad 21000, Serbia
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an extension to the SEDREAMS algorithm for extracting the information about glottal opening and glottal closure instants (GCI and GOI) directly from the speech signal. Accurate detection of GCIs and GOIs is crucial for estimating the glottal features which are to be used in speaker recognition systems. Many different approaches resulted in a variety of algorithms dealing with this problem. The algorithm that showed the best results so far consists of two steps. First, a mean-based signal is computed to determine the intervals in which the GCI and GOI moments should be searched for. Then, discontinuities are sought in the LP residual of the speech signal and they represent the estimation of glottal events. This algorithm (in literature found under the name SEDREAMS) is widely used in glottal excitation estimation systems. However, the mean-based signal calculated in the first step of the process sometimes contains unwanted spectral components which significantly degrade the performances. This paper describes one way to address this problem. By applying an adaptive filter to the mean-based signal significant improvement has been achieved in glottal features estimation. This was confirmed by a speaker recognition experiment which showed very encouraging results.
引用
收藏
页码:369 / 373
页数:5
相关论文
共 50 条
  • [31] Fusion of acoustic and tokenization features for speaker recognition
    Tong, Rong
    Ma, Bin
    Lee, Kong-Aik
    You, Changhuai
    Zhu, Donglai
    Kinnunen, Tomi
    Sun, Hanwu
    Dong, Minghui
    Chng, Eng-Siong
    Li, Haizhou
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 566 - +
  • [32] PARALLEL TRANSFORMATION NETWORK FEATURES FOR SPEAKER RECOGNITION
    Abad, Alberto
    Luque, Jordi
    Trancoso, Isabel
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5300 - 5303
  • [33] Speaker Recognition using Spectral Dimension Features
    Chen, Wen-Shiung
    Huang, Jr-Feng
    2009 FOURTH INTERNATIONAL MULTI-CONFERENCE ON COMPUTING IN THE GLOBAL INFORMATION TECHNOLOGY (ICCGI 2009), 2009, : 132 - 137
  • [34] Long span prosodic features for speaker recognition
    Zhang, Jianping
    Li, Ming
    Suo, Hongbin
    Yang, Lin
    Fu, Qiang
    Yan, Yonghong
    Shengxue Xuebao/Acta Acustica, 2010, 35 (02): : 267 - 269
  • [35] On the use of complementary spectral features for speaker recognition
    Hosseinzadeh, Danoush
    Krishnan, Sridhar
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)
  • [36] FM Features for Automatic Forensic Speaker Recognition
    Thiruvaran, Tharmarajah
    Ambikairajah, Eliathamby
    Epps, Julien
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1497 - 1500
  • [37] SURVEY AND EVALUATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
    Lawson, A.
    Vabishchevich, P.
    Huggins, M.
    Ardis, P.
    Battles, B.
    Stauffer, A.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5444 - 5447
  • [38] Looking for relevant features for speaker role recognition
    IRIT, Unversité de Toulouse, 118 route de Narbonne, F-31062 Toulouse Cedex 9, France
    Proc. Annu. Conf. Int. Speech Commun. Assoc., INTERSPEECH, (1057-1060):
  • [39] Speaker recognition using prosodic and lexical features
    Kajarekar, S
    Ferrer, L
    Venkataraman, A
    Sonmez, K
    Shriberg, E
    Stolcke, A
    Bratt, H
    Gadde, RR
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 19 - 24
  • [40] Integration of complementary acoustic features for speaker recognition
    Zheng, Nengheng
    Lee, Tan
    Ching, P. C.
    IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (03) : 181 - 184