A two-level classifier for text-independent speaker identification

被引：5

作者：

Hadjitodorov, S

Boyanov, B

Dalakchieva, N

机构：

[1] Ctrl. Lab. of Biomedical Engineering, Bulgarian Academy of Sciences, 1113 Sofia, Acad. G. Bonchev Str.

来源：

SPEECH COMMUNICATION | 1997年 / 21卷 / 03期

关键词：

speaker identification; neural networks; self-organizing map; MLP network; two-level classifier;

D O I：

10.1016/S0167-6393(97)00004-6

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A two-level scheme for speaker identification is proposed. The first classifier level is based on the self-organizing map (SOM) of Kohonen. LPCC coefficients are used as input vectors for this classifier. LPCC coefficients are passed again through the already trained SOMs and as result the prototype distribution maps (PDMs) are obtained. The PDMs are the input for the second classifier level. The second level consists of multilayer perceptron (MLP) networks for each speaker. The first level of the classifier is a preprocessing procedure for the second level, where the final classification is made. The goal of the proposed approach is to combine the advantages of the two type of networks into one classification scheme in order to achieve higher identification accuracy. The experiments show an increased accuracy of the proposed two-level classifier, especially in the case of noise-corrupted signals.

引用

页码：209 / 217

页数：9

共 50 条

[31] Combining Dynamic Features with MFCC for Text-independent Speaker Identification
Chaudhari, Amol
Rahulkar, Amol
Dhonde, S. B.
2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 160 - 164
[32] Text-independent speaker identification using robust statistics estimation
El Ayadi, Moataz
Hassan, Abdel-Karim S. O.
Abdel-Naby, Ahmed
Elgendy, Omar A.
SPEECH COMMUNICATION, 2017, 92 : 52 - 63
[33] Wavelet entropy and neural network for text-independent speaker identification
Daqrouq, Khaled
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (05) : 796 - 802
[34] HCRF-UBM approach for text-independent speaker identification
Hong, Wei-Tyng
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2012, 64 (05) : 1120 - 1127
[35] Text-Independent Speaker Identification by Combining MFCC and MVA Features
Korba, Mohamed Cherif Amara
Bourouba, Houcine
Rafik, Djemili
2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
[36] A robust wavelet-based text-independent speaker identification
Phung Trung Nghia
Pham Viet Binh
Nguyen Huu Thai
Nguyen Thanh Ha
Kumsawat, Prayoth
ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 219 - 223
[37] I-vector Based Text-Independent Speaker Identification
Liu, Tingting
Kang, Kai
Guan, Shengxiao
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 5420 - 5425
[38] Text-Independent Speaker Identification Using the Histogram Transform Model
Ma, Zhanyu
Yu, Hong
Tan, Zheng-Hua
Guo, Jun
IEEE ACCESS, 2016, 4 : 9733 - 9739
[39] Robust text-independent speaker identification over telephone channels
Murthy, HA
Beaufays, F
Heck, LP
Weintraub, M
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 554 - 568
[40] Text-independent speaker identification based on spectral weighting functions
Ma, JY
Gao, W
AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 267 - 272

← 1 2 3 4 5 →