A Cutting-Edge Feature Extraction Approach for Speaker Recognition Leveraging Optimized Variance Spectral Flux and Daubechies Wavelet

被引:0
|
作者
Prabha, Chander [1 ]
Kaur, Sukhvinder [2 ]
Malik, Meena [3 ]
Uddin, Mueen [4 ]
Nandan, Durgesh [5 ]
机构
[1] Chitkara Univ, Inst Engn & Technol, Rajpura 140417, Punjab, India
[2] Swami Devi Dyal Inst Engn & Technol, Panchkula 134009, India
[3] Chandigarh Univ, Dept Comp Sci & Engn, Mohali 140413, Punjab, India
[4] Univ Doha Sci & Technol, Coll Comp & Informat Technol, Doha 24449, Qatar
[5] Symbiosis Int Deemed Univ, Symbiosis Inst Technol, Dept Elect & Telecommun Engn, Pune 412115, India
关键词
Daubechies wavelet; Bayesian information criterion; optimized variance spectral flux; mel-frequency cepstral coefficients;
D O I
10.18280/ts.400645
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Speaker Recognition (ASR) is a crucial application in the realm of speech processing, with Artificial Intelligence (AI) being extensively employed in areas such as authentication, surveillance, forensics, and security. The cornerstone processes of these applications encompass feature matching, feature extraction, and performance evaluation. However, the present speaker identification and verification techniques are not without their flaws, including vulnerability to distortion resulting from noise and the ability to mimic signals via voice recording devices. Given these challenges, there's a pressing need for a fresh feature extraction technique that offers robust speaker identification using an enhanced spectrogram. This paper addresses this need by proposing an innovative and efficient feature extraction methodology, christened "Optimized Variance Spectral Flux (OVSF)". This potent technique, based on the Daubechies 40 wavelet and power spectrum of a signal, facilitates the extraction of unique features of the speaker. For the feature matching phase in speaker recognition, the characteristics of different speakers are compared by applying the time-honored Bayesian information criterion distance metric. The proposed system's effectiveness is assessed through a series of metrics including Receiver Operating Characteristics (ROC), detection error trade-off curves, the Equal Error Rate (EER), and the Area under the Curve (AUC). The experimental results yield an AUC and EER for the proposed method of 94.38 and 10.3564, respectively, indicating a higher accuracy than the mel-frequency cepstral coefficient technique.
引用
收藏
页码:2845 / 2852
页数:8
相关论文
empty
未找到相关数据