A Speaker Identification System using MFCC Features with VQ Technique

被引:22
|
作者
Zulfiqar, Ali [1 ]
Muhammad, Aslam [2 ]
Enriquez A M, Martinez [3 ]
机构
[1] UoG, Dept CS & IT, Gujrat, Pakistan
[2] UET, Dept CS & E, Lahore, Pakistan
[3] CINVESTAV, IPN, Dept CS, Ciudad De Mexico, DF, Mexico
关键词
RECOGNITION;
D O I
10.1109/IITA.2009.420
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of speaker identification systems has improved due to recent advances in speech processing techniques but there is still need of improvement in term of text-independent speaker identification and suitable modelling techniques for voice feature vectors. It becomes difficult for person to recognize a voice when an uncontrollable noise adds in to it. In this paper, feature vectors from speech are extracted by using Mel-Frequency Cepstral Coefficients and Vector Quantization technique is implemented through Linde-Buzo-Gray algorithm. Two purposeful speech databases with added noise, recorded at sampling frequencies 8000 Hz and 11025 Hz, are used to check the accuracy of the developed speaker identification system in non-ideal conditions. An analysis is also provided by performing different experiments on the databases that number of vectors in VQ codebook and sampling frequency influence the identification accuracy significantly.
引用
收藏
页码:115 / +
页数:2
相关论文
共 50 条
  • [21] Analysis of Throat Microphone Using MFCC Features for Speaker Recognition
    Visalakshi, R.
    Dhanalakshmi, P.
    Palanivel, S.
    COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 35 - 41
  • [22] Speaker identification using the VQ-based discriminative kernels
    Lei, ZC
    Yang, YC
    Wu, ZH
    AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 797 - 803
  • [23] Efficient Window for Monolingual and Crosslingual Speaker Identification using MFCC
    Nagaraja, B. G.
    Jayanna, H. S.
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
  • [24] An ASR System using MFCC and VQ/GMM with Emphasis on Environmental Dependency
    Barai, Bidhan
    Das, Debayan
    Das, Nibaran
    Basu, Subhadip
    Nasipuri, Mita
    2017 IEEE CALCUTTA CONFERENCE (CALCON), 2017, : 362 - 366
  • [25] ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score
    Bharath K P
    Rajesh Kumar M
    Multimedia Tools and Applications, 2020, 79 : 28859 - 28883
  • [26] Music Identification Using Pitch Histogram and MFCC-VQ Dynamic Pattern
    Park, Chuleui
    Park, Mansoo
    Kim, Sungtak
    Kim, Hoirin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2005, 24 (03): : 178 - 185
  • [27] ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score
    Bharath, K. P.
    Kumar, Rajesh M.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) : 28859 - 28883
  • [28] Usoidal model based speaker identification using VQ and DHMM
    Senthil, RG
    Dandapat, S
    PROCEEDINGS OF THE IEEE INDICON 2004, 2004, : 338 - 343
  • [29] An FPGA based VQ for speaker identification
    Elmisery, FA
    Khaleil, AH
    Salama, AE
    El-Geldawi, F
    17th ICM 2005: 2005 International Conference on Microelectronics, Proceedings, 2005, : 130 - 132
  • [30] Modelling a Voice Activated Speaker Identification System using MFCC-Pitch-Formant Vector
    Avik Sengupta
    Rabindranath Ghosh
    Journal of The Institution of Engineers (India): Series B, 2012, 93 (1) : 51 - 56