A new genetically optimized GMM for speaker recognition

被引:0
|
作者
Lin, Lin [1 ]
Wang, Shuxun [1 ]
机构
[1] Jilin Univ, Dept Commun Engn, Changchun, Jilin Prov, Peoples R China
关键词
Gaussian mixture models; speaker recognition; niche hybrid genetic algorithms;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The traditional training method of Gaussian mixture model is sensitive to the initial model parameters, and easy to lead to a sub-optimal model in practice. To resolve this problem, it utilized the niche hybrid genetic algorithms (NHGA) to find the optimum model parameters. It provided a new architecture of hybrid algorithms, which organically merged the niche techniques and maximum likelihood (ML) algorithm into GA. It used the niche techniques to make the exploration capabilities of GA stronger, and the ML algorithm to make the exploitation capabilities of GA more powerful. Besides, it used a heuristic updating strategy to control the GA mixture crossover rate P-c and mutation rate P-m. Experiments were based on an independent speaker recognition system. The results from PKU-SRSC database show that this method can obtain more optimum GMM parameters and better results than the traditional and the improved GMM for speaker recognition.
引用
收藏
页码:704 / 704
页数:1
相关论文
共 50 条
  • [41] Optimized One-Bit Quantization for Adapted GMM-Based Speaker Verification
    Tseng, Ivy H.
    Verscheure, Olivier
    Turaga, Deepak S.
    Chaudhari, Upendra V.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2061 - +
  • [42] Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM
    Nakagawa, S
    Zhang, W
    Takahashi, M
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 81 - 84
  • [43] Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
    Nirmalya Sen
    Md Sahidullah
    Hemant A. Patil
    Shyamal Kumar Das Mandal
    Krothapalli Sreenivasa Rao
    Tapan Kumar Basu
    International Journal of Speech Technology, 2021, 24 : 1067 - 1088
  • [44] Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
    Sen, Nirmalya
    Sahidullah, Md
    Patil, Hemant A.
    Das Mandal, Shyamal Kumar
    Rao, Krothapalli Sreenivasa
    Basu, Tapan Kumar
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (04) : 1067 - 1088
  • [45] An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition
    You, Chang Huai
    Lee, Kong Aik
    Li, Haizhou
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (1-3) : 49 - 52
  • [46] GMM-UBM Modeling for Speaker Recognition on a Romanian Large Speech Corpora
    Georgescu, Alexandru-Lucian
    Cucu, Horia
    2018 12TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2018, : 547 - 551
  • [47] Comparison of the impact of some Minkowski metrics on VQ/GMM based speaker recognition
    Hanilci, Cemal
    Ertas, Figen
    COMPUTERS & ELECTRICAL ENGINEERING, 2011, 37 (01) : 41 - 56
  • [48] SVM AGAINST GMM/SVM FOR DIALECT INFLUENCE ON AUTOMATIC SPEAKER RECOGNITION TASK
    Zergat, Kawthar
    Amrouche, Abderrahmane
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2014, 13 (02)
  • [49] GMM-based Bhattacharyya kernel Fisher Discriminant Analysis for speaker recognition
    Chao, YH
    Wang, HM
    Chang, RC
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 649 - 652
  • [50] Optimized Active Learning Strategy for Audiovisual Speaker Recognition
    Karlos, Stamatis
    Kaleris, Konstantinos
    Fazakis, Nikos
    Kanas, Vasileios G.
    Kotsiantis, Sotiris
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 281 - 290