Optimization of Gabor features for text-independent speaker identification

被引:3
|
作者
Mildner, Volker [1 ]
Goetze, Stefan [1 ]
Kammeyer, Karl-Dirk [1 ]
Mertins, Alfred [2 ]
机构
[1] Univ Bremen, Dept Commun Engn, D-28334 Bremen, Germany
[2] Carl von Ossietzky Univ Oldenburg, Signal Proc Grp, D-26111 Oldenburg, Germany
关键词
D O I
10.1109/ISCAS.2007.378660
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For text-independent speaker identification a prominent combination is to use Gaussian Mixture Models (GMM) for classification while relying on Mel-Frequency Cepstral Coefficients (MFCC) as features. To take temporal information into account the time difference of features of adjacent speech frames are appended to the initial features. In this paper we investigate the applicability of spectro-temporal features obtained from Gabor-Filters and present an algorithm for optimizing the possible parameters. Simulation results on a database show that spectro-temporal features achieve higher recognition rates than purely temporal features for clean speech as well as for disturbed speech.
引用
收藏
页码:3932 / +
页数:2
相关论文
共 50 条
  • [31] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
    REYNOLDS, DA
    ROSE, RC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 72 - 83
  • [32] Text-independent speaker identification based on support vector machines
    He, Xin
    Liu, Chongqing
    Li, Jiegu
    Jisuanji Gongcheng/Computer Engineering, 2000, 26 (06): : 61 - 63
  • [33] A real-time text-independent speaker identification system
    Cordella, LP
    Foggia, P
    Sansone, C
    Vento, M
    12TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2003, : 632 - 637
  • [34] Text-independent speaker identification using robust statistics estimation
    El Ayadi, Moataz
    Hassan, Abdel-Karim S. O.
    Abdel-Naby, Ahmed
    Elgendy, Omar A.
    SPEECH COMMUNICATION, 2017, 92 : 52 - 63
  • [35] Wavelet entropy and neural network for text-independent speaker identification
    Daqrouq, Khaled
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (05) : 796 - 802
  • [36] A two-level classifier for text-independent speaker identification
    Hadjitodorov, S
    Boyanov, B
    Dalakchieva, N
    SPEECH COMMUNICATION, 1997, 21 (03) : 209 - 217
  • [37] HCRF-UBM approach for text-independent speaker identification
    Hong, Wei-Tyng
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2012, 64 (05) : 1120 - 1127
  • [38] A robust wavelet-based text-independent speaker identification
    Phung Trung Nghia
    Pham Viet Binh
    Nguyen Huu Thai
    Nguyen Thanh Ha
    Kumsawat, Prayoth
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 219 - 223
  • [39] I-vector Based Text-Independent Speaker Identification
    Liu, Tingting
    Kang, Kai
    Guan, Shengxiao
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 5420 - 5425
  • [40] Text-Independent Speaker Identification Using the Histogram Transform Model
    Ma, Zhanyu
    Yu, Hong
    Tan, Zheng-Hua
    Guo, Jun
    IEEE ACCESS, 2016, 4 : 9733 - 9739