SVM AGAINST GMM/SVM FOR DIALECT INFLUENCE ON AUTOMATIC SPEAKER RECOGNITION TASK

被引:1
|
作者
Zergat, Kawthar [1 ]
Amrouche, Abderrahmane [1 ]
机构
[1] USTHB, Speech Commun & Signal Proc Lab LCPTS, Fac Elect & Comp Sci, Bab Ezzouar 16111, Algeria
关键词
Speaker verification; support vector machines; GMM/SVM; dialect effect; PCA;
D O I
10.1142/S1469026814500126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A big deal for current research on automatic speaker recognition is the er effectiveness of the speaker modeling techniques for the talkers, because they have their own speaking style, depending on their specific accents and dialects. This paper investigates on the influence of the dialect and the size of database on the text independent speaker verification task using the SVM and the hybrid GMM/SVM speaker modeling. The Principal Component Analysis (PCA) technique is used in the front-end part of the speaker recognition system, in order to extract the most representative features. Experimental results show that the size of database has an important impact on the SVM and GMM/SVM based speaker verification performances, while the dialect has no significant effect. Applying PCA dimensionality reduction improves the recognition accuracy for both SVM and GMM/SVM based recognition systems. However, it did not give an obvious observation about the dialect effect.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A Method to Integrate GMM, SVM and DTW for Speaker Recognition
    Ding, Ing-Jr
    Yen, Chih-Ta
    Ou, Da-Cheng
    INTERNATIONAL JOURNAL OF ENGINEERING AND TECHNOLOGY INNOVATION, 2014, 4 (01) : 38 - 47
  • [2] New scheme based on GMM-PCA-SVM modelling for automatic speaker recognition
    Zergat, Kawthar
    Amrouche, Abderrahmane
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (04) : 373 - 381
  • [3] Robust PCA-GMM-SVM System for Speaker Verification Task
    Zergat, Kawthar Yasmine
    Amrouche, Abderrahmane
    Asbai, Nassim
    Debyeche, Mohamed
    8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 214 - 217
  • [4] A GMM SUPERVECTOR KERNEL WITH THE BHATTACHARYYA DISTANCE FOR SVM BASED SPEAKER RECOGNITION
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4221 - 4224
  • [5] Dialect Recognition Using a Phone-GMM-Supervector-Based SVM Kernel
    Biadsy, Fadi
    Hirschberg, Julia
    Collins, Michael
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 753 - +
  • [6] Performances Evaluation of GMM-UBM and GMM-SVM for Speaker Recognition in Realistic World
    Asbai, Nassim
    Amrouche, Abderrahmane
    Debyeche, Mohamed
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 284 - 291
  • [7] A new hybrid GMM/SVM for speaker verification
    Liu, Minghui
    Xie, Yanlu
    Yao, Zhiqiang
    Dai, Beiqian
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 314 - +
  • [8] A hybrid GMM/SVM approach to speaker identification
    Fine, S
    Navrátil, J
    Gopinath, RA
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 417 - 420
  • [9] A Robust SVM/GMM Classifier for Speaker Verification
    Cirovic, Zoran
    Cirovic, Natasa
    SPEECH AND COMPUTER, 2014, 8773 : 74 - 80
  • [10] An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (1-3) : 49 - 52