An Improved Approach for Text-Independent Speaker Recognition

被引:1
|
作者
Chakroun, Rania [1 ,4 ]
Zouari, Leila Beltaifa [1 ,3 ]
Frikha, Mondher [1 ,2 ]
机构
[1] Natl Sch Elect & Telecommun Sfax, ATMS Res Unit, Sfax, Tunisia
[2] Natl Sch Elect & Telecommun Sfax, Sfax, Tunisia
[3] Natl Sch Engn Sousse, Sousse, Tunisia
[4] Natl Sch Engn Sfax, Sfax, Tunisia
关键词
GMM; speaker verification; speaker recognition; speaker identification;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents new Speaker Identification and Speaker Verification systems based on the use of new feature vectors extracted from the speech signal. The proposed structure combine between the most successful Mel Frequency Cepstral Coefficients and new features which are the Short Time Zero Crossing Rate of the signal. A comparison between speaker recognition systems based on Gaussian mixture models using the well known Mel Frequency Cepstral Coefficients and the novel systems based on the use of a combination between both reduced Mel Frequency Cepstral Coefficients features vectors and Short Time Zero Crossing Rate features is given. This comparison proves that the use of the new reduced feature vectors help to improve the system's performance and also help to reduce the time and memory complexity of the system which is required for realistic applications that suffer from computational resource limitation. The experiments were performed on speakers from TIMIT database for different training durations. The suggested systems performances are evaluated against the baseline systems. The increase of the proposed systems performances are well observed for identification experiments and the decrease of Equal Error Rates are also remarkable for verification experiments. Experimental results demonstrate the effectiveness of the new approach which avoids the use of more complex algorithms or the combination of different approaches requiring lengthy calculation.
引用
收藏
页码:343 / 348
页数:6
相关论文
共 50 条
  • [31] TLS-NAP algorithm for text-independent speaker recognition
    He, Liang
    Yang, Yi
    Liu, Jia
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2012, 25 (06): : 916 - 921
  • [32] A TEXT-INDEPENDENT SPEAKER RECOGNITION SYSTEM BASED ON VOWEL SPOTTING
    FAKOTAKIS, N
    TSOPANOGLOU, A
    KOKKINAKIS, G
    SPEECH COMMUNICATION, 1993, 12 (01) : 57 - 68
  • [33] Data-Model Relationship in Text-Independent Speaker Recognition
    John S. D. Mason
    Nicholas W. D. Evans
    Robert Stapert
    Roland Auckenthaler
    EURASIP Journal on Advances in Signal Processing, 2005
  • [34] Adaptive fuzzy wavelet algorithm for text-independent speaker recognition
    Lung, SY
    PATTERN RECOGNITION, 2004, 37 (10) : 2095 - 2096
  • [35] Investigation of the effect of data duration and speaker gender on text-independent speaker recognition
    Hanilci, Cemal
    Ertas, Figen
    COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (02) : 441 - 452
  • [36] Spin-Image Descriptors for Text-Independent Speaker Recognition
    Mohammed, Suhaila N.
    Jabir, Adnan J.
    Abbas, Zaid Ali
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 216 - 226
  • [37] FREQUENCY AND TEMPORAL CONVOLUTIONAL ATTENTION FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
    Yadav, Sarthak
    Rai, Atul
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6794 - 6798
  • [38] A Multiscale Feature Extraction Method for Text-independent Speaker Recognition
    Chen Zhigao
    Li Peng
    Xiao Runqiu
    Li Ta
    Wang Wenchao
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (11) : 3266 - 3271
  • [39] Ensemble of Support Vector Machine for Text-Independent Speaker Recognition
    Lei, Zhenchun
    Yang, Yingchun
    Wu, Zhaohui
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (5A): : 163 - 167
  • [40] Data-model relationship in text-independent speaker recognition
    Mason, JSD
    Evans, NWD
    Stapert, R
    Auckenthaler, R
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (04) : 471 - 481