Stationary wavelet Filtering Cepstral coefficients (SWFCC) for robust speaker identification

被引:0
|
作者
Missaoui, Ibrahim [1 ,2 ]
Lachiri, Zied [1 ]
机构
[1] Univ Tunis El Manar, Natl Engn Sch Tunis ENIT, Signal Images & Informat Technol Lab, LR-11-ES17,BP 37, Tunis 1002, Tunisia
[2] Univ Gabes, Higher Inst Comp Sci & Multimedia Gabes, Gabes, Tunisia
关键词
Stationary wavelet filtering cepstral; coefficients; SWFCC; SWT; Stationary wavelet packet transform; Implicit wiener filtering; Feature extraction; GMM-UBM; Robust speaker recognition; SPEECH WAVE; PACKET;
D O I
10.1016/j.apacoust.2024.110435
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Extracting robust effective speech features is one of the challenging topics in the speaker recognition field, especially in noisy conditions. It can substantially improve the robustness recognition accuracy of persons from their voice signals against such conditions. This paper proposes a new feature extraction approach called Stationary Wavelet Filtering Cepstral Coefficients (SWFCC) for noisy speaker recognition. The proposed approach incorporates a Stationary Wavelet Filterbank (SWF) and an Implicit Wiener Filtering (IWF) technique. The SWF is based on the stationary wavelet packet transform, which is a shift-invariant transform. The performance of the proposed SWFCC approach is evaluated on the TIMIT dataset in the presence of different types of environmental noise, which are taken from the Aurora dataset. Our experimental results using the Gaussian Mixture ModelUniversal Background Model (GMM-UBM) as a classifier show that SWFCC outperforms various feature extraction techniques like MFCC, PNCC, and GFCC in terms of recognition accuracy.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A robust wavelet-based text-independent speaker identification
    Phung Trung Nghia
    Pham Viet Binh
    Nguyen Huu Thai
    Nguyen Thanh Ha
    Kumsawat, Prayoth
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 219 - 223
  • [32] Mean Hilbert envelope coefficients (MHEC) for robust speaker and language identification
    Sadjadi, Seyed Omid
    Hansen, John H. L.
    SPEECH COMMUNICATION, 2015, 72 : 138 - 148
  • [33] Speaker Identification using Warped MVDR Cepstral Features
    Woelfel, Matthias
    Yang, Qian
    Jin, Qin
    Schultz, Tanja
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 904 - +
  • [34] Perceptual MVDR-Based Cepstral Coefficients (PMCCs) for Speaker Recognition
    Liang, Chunyan
    Zhang, Xiang
    Yang, Lin
    Zhang, Jianping
    Yan, Yonghong
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 1386 - 1389
  • [35] Constant Q cepstral coefficients: A spoofing countermeasure for automatic speaker verification
    Todisco, Massimiliano
    Delgado, Hector
    Evans, Nicholas
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 516 - 535
  • [36] Wavelet Packet Sub-band Cepstral Coefficient for Speaker Verification
    Min, Hang
    Wei, Guangcun
    Xu, Yunfei
    Zhang, Yanna
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 1713 - 1717
  • [37] Perceptual MVDR-based cepstral coefficients(PMCCs)for speaker recognition
    LIANG Chunyan ZHANG Xiang YANG Lin ZHANG Jianping YAN Yonghong (Key Laboratory of Speech Acoustics and Content Understanding
    ChineseJournalofAcoustics, 2012, 31 (04) : 489 - 498
  • [38] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424
  • [39] Damped Oscillator Cepstral Coefficients for Robust Speech Recognition
    Mitra, Vikramjit
    Franco, Horacio
    Graciarena, Martin
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 886 - 890
  • [40] Daubechies Wavelet Cepstral Coefficients for Parkinson's Disease Detection
    Zayrit, Soumaya
    Belhoussine Drissi, Taoufiq
    Ammoumou, Abdelkrim
    Nsiri, Benayad
    COMPLEX SYSTEMS, 2020, 29 (03): : 729 - 739