An Improved Approach for Text-Independent Speaker Recognition

被引:1
|
作者
Chakroun, Rania [1 ,4 ]
Zouari, Leila Beltaifa [1 ,3 ]
Frikha, Mondher [1 ,2 ]
机构
[1] Natl Sch Elect & Telecommun Sfax, ATMS Res Unit, Sfax, Tunisia
[2] Natl Sch Elect & Telecommun Sfax, Sfax, Tunisia
[3] Natl Sch Engn Sousse, Sousse, Tunisia
[4] Natl Sch Engn Sfax, Sfax, Tunisia
关键词
GMM; speaker verification; speaker recognition; speaker identification;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents new Speaker Identification and Speaker Verification systems based on the use of new feature vectors extracted from the speech signal. The proposed structure combine between the most successful Mel Frequency Cepstral Coefficients and new features which are the Short Time Zero Crossing Rate of the signal. A comparison between speaker recognition systems based on Gaussian mixture models using the well known Mel Frequency Cepstral Coefficients and the novel systems based on the use of a combination between both reduced Mel Frequency Cepstral Coefficients features vectors and Short Time Zero Crossing Rate features is given. This comparison proves that the use of the new reduced feature vectors help to improve the system's performance and also help to reduce the time and memory complexity of the system which is required for realistic applications that suffer from computational resource limitation. The experiments were performed on speakers from TIMIT database for different training durations. The suggested systems performances are evaluated against the baseline systems. The increase of the proposed systems performances are well observed for identification experiments and the decrease of Equal Error Rates are also remarkable for verification experiments. Experimental results demonstrate the effectiveness of the new approach which avoids the use of more complex algorithms or the combination of different approaches requiring lengthy calculation.
引用
收藏
页码:343 / 348
页数:6
相关论文
共 50 条
  • [41] Adaptive Convolutional Neural Network for Text-Independent Speaker Recognition
    Kim, Seong-Hu
    Park, Yong-Hwa
    INTERSPEECH 2021, 2021, : 66 - 70
  • [42] Learning Vector Quantization in text-independent Automatic Speaker Recognition
    Filgueiras, TE
    Messina, RO
    Cabral, EF
    VTH BRAZILIAN SYMPOSIUM ON NEURAL NETWORKS, PROCEEDINGS, 1998, : 135 - 139
  • [43] Angular Margin Centroid Loss for Text-independent Speaker Recognition
    Wei, Yuheng
    Du, Junzhao
    Liu, Hui
    INTERSPEECH 2020, 2020, : 3820 - 3824
  • [44] An overview of text-independent speaker recognition: From features to supervectors
    Kinnunen, Tomi
    Li, Haizhou
    SPEECH COMMUNICATION, 2010, 52 (01) : 12 - 40
  • [45] Formant Trajectories in Linguistic Units for Text-Independent Speaker Recognition
    Franco-Pedroso, Javier
    Espinoza-Cuadros, Fernando
    Gonzalez-Rodriguez, Joaquin
    2013 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2013,
  • [46] TEACHER-STUDENT TRAINING FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
    Ng, Raymond W. M.
    Liu, Xuechen
    Swietojanski, Pawel
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1044 - 1051
  • [47] Robust features for text-independent speaker recognition with short utterances
    Chakroun, Rania
    Frikha, Mondher
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (17): : 13863 - 13883
  • [48] Improved spectral subtraction technique for text-independent speaker verfication
    Panda, Ashish
    Tripathi, Neha
    Srikanthan, Thambipillai
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 595 - +
  • [49] An Improved Approach to Open Set Text-Independent Speaker Identification (OSTI-SI)
    Chakraborty, ShrutiSarika
    Parekh, Ranjan
    2017 THIRD IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2017, : 51 - 56
  • [50] VQ score normalisation for text-dependent and text-independent speaker recognition
    Finan, RA
    Sapeluk, AT
    Damper, RI
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 211 - 218