A two-level classifier for text-independent speaker identification

被引:5
|
作者
Hadjitodorov, S
Boyanov, B
Dalakchieva, N
机构
[1] Ctrl. Lab. of Biomedical Engineering, Bulgarian Academy of Sciences, 1113 Sofia, Acad. G. Bonchev Str.
关键词
speaker identification; neural networks; self-organizing map; MLP network; two-level classifier;
D O I
10.1016/S0167-6393(97)00004-6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A two-level scheme for speaker identification is proposed. The first classifier level is based on the self-organizing map (SOM) of Kohonen. LPCC coefficients are used as input vectors for this classifier. LPCC coefficients are passed again through the already trained SOMs and as result the prototype distribution maps (PDMs) are obtained. The PDMs are the input for the second classifier level. The second level consists of multilayer perceptron (MLP) networks for each speaker. The first level of the classifier is a preprocessing procedure for the second level, where the final classification is made. The goal of the proposed approach is to combine the advantages of the two type of networks into one classification scheme in order to achieve higher identification accuracy. The experiments show an increased accuracy of the proposed two-level classifier, especially in the case of noise-corrupted signals.
引用
收藏
页码:209 / 217
页数:9
相关论文
共 50 条
  • [31] Combining Dynamic Features with MFCC for Text-independent Speaker Identification
    Chaudhari, Amol
    Rahulkar, Amol
    Dhonde, S. B.
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 160 - 164
  • [32] Text-independent speaker identification using robust statistics estimation
    El Ayadi, Moataz
    Hassan, Abdel-Karim S. O.
    Abdel-Naby, Ahmed
    Elgendy, Omar A.
    SPEECH COMMUNICATION, 2017, 92 : 52 - 63
  • [33] Wavelet entropy and neural network for text-independent speaker identification
    Daqrouq, Khaled
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (05) : 796 - 802
  • [34] HCRF-UBM approach for text-independent speaker identification
    Hong, Wei-Tyng
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2012, 64 (05) : 1120 - 1127
  • [35] Text-Independent Speaker Identification by Combining MFCC and MVA Features
    Korba, Mohamed Cherif Amara
    Bourouba, Houcine
    Rafik, Djemili
    2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
  • [36] A robust wavelet-based text-independent speaker identification
    Phung Trung Nghia
    Pham Viet Binh
    Nguyen Huu Thai
    Nguyen Thanh Ha
    Kumsawat, Prayoth
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 219 - 223
  • [37] I-vector Based Text-Independent Speaker Identification
    Liu, Tingting
    Kang, Kai
    Guan, Shengxiao
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 5420 - 5425
  • [38] Text-Independent Speaker Identification Using the Histogram Transform Model
    Ma, Zhanyu
    Yu, Hong
    Tan, Zheng-Hua
    Guo, Jun
    IEEE ACCESS, 2016, 4 : 9733 - 9739
  • [39] Robust text-independent speaker identification over telephone channels
    Murthy, HA
    Beaufays, F
    Heck, LP
    Weintraub, M
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 554 - 568
  • [40] Text-independent speaker identification based on spectral weighting functions
    Ma, JY
    Gao, W
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 267 - 272