Optimization of Gabor features for text-independent speaker identification

被引：3

作者：

Mildner, Volker ^{[1
]}

Goetze, Stefan ^{[1
]}

Kammeyer, Karl-Dirk ^{[1
]}

Mertins, Alfred ^{[2
]}

机构：

[1] Univ Bremen, Dept Commun Engn, D-28334 Bremen, Germany

[2] Carl von Ossietzky Univ Oldenburg, Signal Proc Grp, D-26111 Oldenburg, Germany

来源：

2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11 | 2007年

关键词：

D O I：

10.1109/ISCAS.2007.378660

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For text-independent speaker identification a prominent combination is to use Gaussian Mixture Models (GMM) for classification while relying on Mel-Frequency Cepstral Coefficients (MFCC) as features. To take temporal information into account the time difference of features of adjacent speech frames are appended to the initial features. In this paper we investigate the applicability of spectro-temporal features obtained from Gabor-Filters and present an algorithm for optimizing the possible parameters. Simulation results on a database show that spectro-temporal features achieve higher recognition rates than purely temporal features for clean speech as well as for disturbed speech.

引用

页码：3932 / +

页数：2

共 50 条

[41] Robust text-independent speaker identification over telephone channels
Murthy, HA
Beaufays, F
Heck, LP
Weintraub, M
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 554 - 568
[42] Text-independent speaker identification based on spectral weighting functions
Ma, JY
Gao, W
AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 267 - 272
[43] Principal Component Based Classification for Text-Independent Speaker Identification
Hanilci, Cemal
Ertas, Figen
2009 FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING, COMPUTING WITH WORDS AND PERCEPTIONS IN SYSTEM ANALYSIS, DECISION AND CONTROL, 2010, : 39 - 42
[44] Text-independent speaker identification utilizing likelihood normalization technique
Toyohashi Univ of Technology, Toyohashi-shi, Japan
IEICE Trans Inf Syst, 5 (585-593):
[45] Text-independent speaker identification utilizing likelihood normalization technique
Markov, KP
Nakagawa, S
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1997, E80D (05) : 585 - 593
[46] Neural network clustering technique for text-independent speaker identification
Nossair, Zaki B.
Zahorian, Stephen A.
Artificial Neural Networks in Engineering - Proceedings (ANNIE'94), 1994, 4 : 453 - 459
[47] Robust text-independent speaker identification using bispectrum slice
Özkurt, TE
Akgül, T
PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 418 - 421
[48] Text-independent speaker verification using ant colony optimization-based selected features
Nemati, Shahla
Basiri, Mohammad Ehsan
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (01) : 620 - 630
[49] Text-independent speaker identification based on MAP channel compensation and pitch-dependent features
Han, Jiqing
Gao, Rongchun
World Academy of Science, Engineering and Technology, 2009, 39 : 659 - 665
[50] A tutorial on text-independent speaker verification
Bimbot, F. (bimbot@irisa.fr), 1600, Hindawi Publishing Corporation (2004):

← 1 2 3 4 5 →