Optimization of Gabor features for text-independent speaker identification

被引:3
|
作者
Mildner, Volker [1 ]
Goetze, Stefan [1 ]
Kammeyer, Karl-Dirk [1 ]
Mertins, Alfred [2 ]
机构
[1] Univ Bremen, Dept Commun Engn, D-28334 Bremen, Germany
[2] Carl von Ossietzky Univ Oldenburg, Signal Proc Grp, D-26111 Oldenburg, Germany
关键词
D O I
10.1109/ISCAS.2007.378660
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For text-independent speaker identification a prominent combination is to use Gaussian Mixture Models (GMM) for classification while relying on Mel-Frequency Cepstral Coefficients (MFCC) as features. To take temporal information into account the time difference of features of adjacent speech frames are appended to the initial features. In this paper we investigate the applicability of spectro-temporal features obtained from Gabor-Filters and present an algorithm for optimizing the possible parameters. Simulation results on a database show that spectro-temporal features achieve higher recognition rates than purely temporal features for clean speech as well as for disturbed speech.
引用
收藏
页码:3932 / +
页数:2
相关论文
共 50 条
  • [41] Robust text-independent speaker identification over telephone channels
    Murthy, HA
    Beaufays, F
    Heck, LP
    Weintraub, M
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 554 - 568
  • [42] Text-independent speaker identification based on spectral weighting functions
    Ma, JY
    Gao, W
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 267 - 272
  • [43] Principal Component Based Classification for Text-Independent Speaker Identification
    Hanilci, Cemal
    Ertas, Figen
    2009 FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING, COMPUTING WITH WORDS AND PERCEPTIONS IN SYSTEM ANALYSIS, DECISION AND CONTROL, 2010, : 39 - 42
  • [44] Text-independent speaker identification utilizing likelihood normalization technique
    Toyohashi Univ of Technology, Toyohashi-shi, Japan
    IEICE Trans Inf Syst, 5 (585-593):
  • [45] Text-independent speaker identification utilizing likelihood normalization technique
    Markov, KP
    Nakagawa, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1997, E80D (05) : 585 - 593
  • [46] Neural network clustering technique for text-independent speaker identification
    Nossair, Zaki B.
    Zahorian, Stephen A.
    Artificial Neural Networks in Engineering - Proceedings (ANNIE'94), 1994, 4 : 453 - 459
  • [47] Robust text-independent speaker identification using bispectrum slice
    Özkurt, TE
    Akgül, T
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 418 - 421
  • [48] Text-independent speaker verification using ant colony optimization-based selected features
    Nemati, Shahla
    Basiri, Mohammad Ehsan
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (01) : 620 - 630
  • [49] Text-independent speaker identification based on MAP channel compensation and pitch-dependent features
    Han, Jiqing
    Gao, Rongchun
    World Academy of Science, Engineering and Technology, 2009, 39 : 659 - 665
  • [50] A tutorial on text-independent speaker verification
    Bimbot, F. (bimbot@irisa.fr), 1600, Hindawi Publishing Corporation (2004):