Overview of speech enhancement techniques for automatic speaker recognition

被引:0
|
作者
OrtegaGarcia, J
GonzalezRodriguez, J
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Real world conditions differ from ideal or laboratory conditions, causing mismatch between training and testing phases, and consequently, inducing performance degradation in automatic speaker recognition systems [1]. Many strategies have been adopted to cope with acoustical degradation; in some applications of speaker identification systems a clean sample of speech, prior to the recognition stage, is needed. This has justified the use of procedures that may reduce the impact of acoustical noise on the desired signal, giving rise to techniques involved in the enhancement of noisy speech [2, 9]. In this paper, a comparative performance analysis of single-channel (based in classical spectral subtraction and some derived alternatives), dual-channel (based in adaptive noise cancelling) and multi-channel (using microphone arrays) speech enhancement techniques, with different types of noise at different SNRs, as a pre-processing stage to an ergodic HMM-based speaker recognizer, is presented.
引用
收藏
页码:929 / 932
页数:4
相关论文
共 50 条
  • [31] Methodologies for the evaluation of Speaker Diarization and Automatic Speech Recognition in the presence of overlapping speech
    Galibert, Olivier
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1130 - 1133
  • [32] SPEAKER-ADAPTABLE CLASSIFICATION PROCEDURE FOR AUTOMATIC SPEECH RECOGNITION
    KATTERFELDT, H
    THON, W
    NACHRICHTENTECHNISCHE ZEITSCHRIFT, 1974, 27 (06): : 230 - 232
  • [33] Analysis of Compressed Speech Signals in an Automatic Speaker Recognition System
    Metzger, Richard A.
    Doherty, John F.
    Jenkins, David M.
    2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2015,
  • [34] Evaluating Automatic Speaker Recognition systems: An overview of the NIST Speaker Recognition Evaluations (1996-2014)
    Gonzalez-Rodriguez, Joaquin
    LOQUENS, 2014, 1 (01):
  • [35] DYNAMIC FREQUENCY WARPING FOR SPEAKER ADAPTATION IN AUTOMATIC SPEECH RECOGNITION
    PALIWAL, KK
    AINSWORTH, WA
    JOURNAL OF PHONETICS, 1985, 13 (02) : 123 - 134
  • [36] Spectral Analysis for Automatic Speech Recognition and Enhancement
    Oruh, Jane
    Viriri, Serestina
    MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 245 - 254
  • [37] TEnet: target speaker extraction network with accumulated speaker embedding for automatic speech recognition
    Li, Wenjie
    Zhang, Pengyuan
    Yan, Yonghong
    ELECTRONICS LETTERS, 2019, 55 (14) : 816 - 818
  • [38] Multi-Stage Speech Enhancement for Automatic Speech Recognition
    Lee, Seungyeol
    Lee, Youngwoo
    Cho, Namgook
    2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2016,
  • [39] An Improved Switch Speech Enhancement Algorithm for Automatic Speech Recognition
    Ma, Yongbao
    Zhou, Yi
    Liu, Jingang
    Xia, Jie
    Liu, Hongqing
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2015, : 430 - 435
  • [40] Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition
    Novotny, Ondrej
    Plchot, Oldrich
    Glembek, Ondrej
    Cernocky, Jan ''Honza''
    Burget, Lukas
    COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 403 - 421