NOISE AND SPEAKER COMPENSATION IN THE LOG FILTER BANK DOMAIN

被引:0
|
作者
Joshi, Vikas [1 ]
Bilgi, Raghavendra [1 ]
Umesh, S. [1 ]
Garcia, L. [2 ]
Benitez, C. [2 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, Madras 600036, Tamil Nadu, India
[2] Univ Granada, Dept Signal Theory Telemat & Commun, E-18071 Granada, Spain
关键词
Speaker Normalization; Noise Compensation; VTS; TVTLN; Noise and Speaker compensation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a method to compensate for noise and speaker-variability directly in the Log filter-bank (FB) domain, so that MFCC features are robust to noise and speaker-variations. For noise-compensation, we use Vector Taylor Series (VTS) approach in the Log FB domain, and speaker-normalization is also done in the Log FB domain using Linear Vocal tract length (VTLN) matrices. For VTLN, optimal selection of warp-factor is done in Log FB domain using canonical GMM model, avoiding the two-pass approach needed by a HMM model. Further, this can be efficiently implemented using sufficient statistics obtained from the GMM and the FB-VTLN-matrices. The warp-factor selection using GMM can also be done in cepstral domain by applying DCT matrices without the usual approximations associated with conventional linear-VTLN. The elegance of the proposed approach is that given the speech data, we obtain directly MFCC features that are robust to noise and speaker-variations. The proposed approach, show a significant relative improvement of 31% over baseline on Aurora-4 task.
引用
收藏
页码:4709 / 4712
页数:4
相关论文
共 50 条
  • [31] Orthonormal ladder structure log-domain filter
    Zhang, Jiang-Hong
    Zhang, Ying-Hui
    Ling, Yun
    ICIC 2009: SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTING SCIENCE, VOL 4, PROCEEDINGS: MODELLING AND SIMULATION IN ENGINEERING, 2009, : 46 - +
  • [32] Log-domain complex filter design with XFILTER
    Teplechuk, MA
    Sewell, JI
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I: ANALOG CIRCUITS AND SIGNAL PROCESSING, 2003, : 545 - 548
  • [33] Direct noise analysis of log-domain filters
    Ng, AEJ
    Sewell, JI
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL III, PROCEEDINGS, 2002, : 309 - 312
  • [34] Direct noise analysis of log-domain filters
    Ng, AEJ
    Sewell, JI
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 2002, 49 (02): : 101 - 109
  • [35] Filter Bank Transmission Systems: Analysis with Phase Noise
    Moret, Nicola
    Tonello, Andrea M.
    2009 6TH INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS (ISWCS 2009), 2009, : 498 - 501
  • [36] Optimum noise reduction of conjugate quadrature filter bank
    Sakitani, K
    Maeda, H
    SIGNAL PROCESSING, 2000, 80 (05) : 819 - 829
  • [37] A Combinational Adaptive Noise Canceller Using Filter Bank
    Mahabadi, Ali Ameri
    Hejazi, Seyed Amir
    Akhaee, Mohammad A.
    Eshghi, Mohammad
    2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 71 - +
  • [38] ADDITIVE NOISE COMPENSATION IN THE I-VECTOR SPACE FOR SPEAKER RECOGNITION
    Ben Kheder, Waad
    Matrouf, Driss
    Bonastre, Jean-Francois
    Ajili, Moez
    Bousquet, Pierre-Michel
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4190 - 4194
  • [39] Spatial Filter Bank Design in the Spherical Harmonic Domain
    Hold, Christoph
    Politis, Archontis
    McCormack, Leo
    Pulkki, Ville
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 106 - 110
  • [40] A fast frequency domain filter bank realization algorithm
    Zhang, C
    Wang, ZH
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 130 - 132