Optimization of Gabor features for text-independent speaker identification

被引：3

作者：

Mildner, Volker ^{[1
]}

Goetze, Stefan ^{[1
]}

Kammeyer, Karl-Dirk ^{[1
]}

Mertins, Alfred ^{[2
]}

机构：

[1] Univ Bremen, Dept Commun Engn, D-28334 Bremen, Germany

[2] Carl von Ossietzky Univ Oldenburg, Signal Proc Grp, D-26111 Oldenburg, Germany

来源：

2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11 | 2007年

关键词：

D O I：

10.1109/ISCAS.2007.378660

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For text-independent speaker identification a prominent combination is to use Gaussian Mixture Models (GMM) for classification while relying on Mel-Frequency Cepstral Coefficients (MFCC) as features. To take temporal information into account the time difference of features of adjacent speech frames are appended to the initial features. In this paper we investigate the applicability of spectro-temporal features obtained from Gabor-Filters and present an algorithm for optimizing the possible parameters. Simulation results on a database show that spectro-temporal features achieve higher recognition rates than purely temporal features for clean speech as well as for disturbed speech.

引用

页码：3932 / +

页数：2

共 50 条

[31] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
REYNOLDS, DA
ROSE, RC
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 72 - 83
[32] Text-independent speaker identification based on support vector machines
He, Xin
Liu, Chongqing
Li, Jiegu
Jisuanji Gongcheng/Computer Engineering, 2000, 26 (06): : 61 - 63
[33] A real-time text-independent speaker identification system
Cordella, LP
Foggia, P
Sansone, C
Vento, M
12TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2003, : 632 - 637
[34] Text-independent speaker identification using robust statistics estimation
El Ayadi, Moataz
Hassan, Abdel-Karim S. O.
Abdel-Naby, Ahmed
Elgendy, Omar A.
SPEECH COMMUNICATION, 2017, 92 : 52 - 63
[35] Wavelet entropy and neural network for text-independent speaker identification
Daqrouq, Khaled
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (05) : 796 - 802
[36] A two-level classifier for text-independent speaker identification
Hadjitodorov, S
Boyanov, B
Dalakchieva, N
SPEECH COMMUNICATION, 1997, 21 (03) : 209 - 217
[37] HCRF-UBM approach for text-independent speaker identification
Hong, Wei-Tyng
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2012, 64 (05) : 1120 - 1127
[38] A robust wavelet-based text-independent speaker identification
Phung Trung Nghia
Pham Viet Binh
Nguyen Huu Thai
Nguyen Thanh Ha
Kumsawat, Prayoth
ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 219 - 223
[39] I-vector Based Text-Independent Speaker Identification
Liu, Tingting
Kang, Kai
Guan, Shengxiao
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 5420 - 5425
[40] Text-Independent Speaker Identification Using the Histogram Transform Model
Ma, Zhanyu
Yu, Hong
Tan, Zheng-Hua
Guo, Jun
IEEE ACCESS, 2016, 4 : 9733 - 9739

← 1 2 3 4 5 →