A Speaker Identification System using MFCC Features with VQ Technique

被引：22

作者：

Zulfiqar, Ali ^{[1
]}

Muhammad, Aslam ^{[2
]}

Enriquez A M, Martinez ^{[3
]}

机构：

[1] UoG, Dept CS & IT, Gujrat, Pakistan

[2] UET, Dept CS & E, Lahore, Pakistan

[3] CINVESTAV, IPN, Dept CS, Ciudad De Mexico, DF, Mexico

来源：

2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS | 2009年

关键词：

RECOGNITION;

D O I：

10.1109/IITA.2009.420

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The performance of speaker identification systems has improved due to recent advances in speech processing techniques but there is still need of improvement in term of text-independent speaker identification and suitable modelling techniques for voice feature vectors. It becomes difficult for person to recognize a voice when an uncontrollable noise adds in to it. In this paper, feature vectors from speech are extracted by using Mel-Frequency Cepstral Coefficients and Vector Quantization technique is implemented through Linde-Buzo-Gray algorithm. Two purposeful speech databases with added noise, recorded at sampling frequencies 8000 Hz and 11025 Hz, are used to check the accuracy of the developed speaker identification system in non-ideal conditions. An analysis is also provided by performing different experiments on the databases that number of vectors in VQ codebook and sampling frequency influence the identification accuracy significantly.

引用

页码：115 / +

页数：2

共 50 条

[21] Analysis of Throat Microphone Using MFCC Features for Speaker Recognition
Visalakshi, R.
Dhanalakshmi, P.
Palanivel, S.
COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 35 - 41
[22] Speaker identification using the VQ-based discriminative kernels
Lei, ZC
Yang, YC
Wu, ZH
AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 797 - 803
[23] Efficient Window for Monolingual and Crosslingual Speaker Identification using MFCC
Nagaraja, B. G.
Jayanna, H. S.
PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
[24] An ASR System using MFCC and VQ/GMM with Emphasis on Environmental Dependency
Barai, Bidhan
Das, Debayan
Das, Nibaran
Basu, Subhadip
Nasipuri, Mita
2017 IEEE CALCUTTA CONFERENCE (CALCON), 2017, : 362 - 366
[25] ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score
Bharath K P
Rajesh Kumar M
Multimedia Tools and Applications, 2020, 79 : 28859 - 28883
[26] Music Identification Using Pitch Histogram and MFCC-VQ Dynamic Pattern
Park, Chuleui
Park, Mansoo
Kim, Sungtak
Kim, Hoirin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2005, 24 (03): : 178 - 185
[27] ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score
Bharath, K. P.
Kumar, Rajesh M.
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) : 28859 - 28883
[28] Usoidal model based speaker identification using VQ and DHMM
Senthil, RG
Dandapat, S
PROCEEDINGS OF THE IEEE INDICON 2004, 2004, : 338 - 343
[29] An FPGA based VQ for speaker identification
Elmisery, FA
Khaleil, AH
Salama, AE
El-Geldawi, F
17th ICM 2005: 2005 International Conference on Microelectronics, Proceedings, 2005, : 130 - 132
[30] Modelling a Voice Activated Speaker Identification System using MFCC-Pitch-Formant Vector
Avik Sengupta
Rabindranath Ghosh
Journal of The Institution of Engineers (India): Series B, 2012, 93 (1) : 51 - 56

← 1 2 3 4 5 →