Overview of speech enhancement techniques for automatic speaker recognition

被引：0

作者：

OrtegaGarcia, J

GonzalezRodriguez, J

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Real world conditions differ from ideal or laboratory conditions, causing mismatch between training and testing phases, and consequently, inducing performance degradation in automatic speaker recognition systems [1]. Many strategies have been adopted to cope with acoustical degradation; in some applications of speaker identification systems a clean sample of speech, prior to the recognition stage, is needed. This has justified the use of procedures that may reduce the impact of acoustical noise on the desired signal, giving rise to techniques involved in the enhancement of noisy speech [2, 9]. In this paper, a comparative performance analysis of single-channel (based in classical spectral subtraction and some derived alternatives), dual-channel (based in adaptive noise cancelling) and multi-channel (using microphone arrays) speech enhancement techniques, with different types of noise at different SNRs, as a pre-processing stage to an ergodic HMM-based speaker recognizer, is presented.

引用

页码：929 / 932

页数：4

共 50 条

[31] Methodologies for the evaluation of Speaker Diarization and Automatic Speech Recognition in the presence of overlapping speech
Galibert, Olivier
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1130 - 1133
[32] SPEAKER-ADAPTABLE CLASSIFICATION PROCEDURE FOR AUTOMATIC SPEECH RECOGNITION
KATTERFELDT, H
THON, W
NACHRICHTENTECHNISCHE ZEITSCHRIFT, 1974, 27 (06): : 230 - 232
[33] Analysis of Compressed Speech Signals in an Automatic Speaker Recognition System
Metzger, Richard A.
Doherty, John F.
Jenkins, David M.
2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2015,
[34] Evaluating Automatic Speaker Recognition systems: An overview of the NIST Speaker Recognition Evaluations (1996-2014)
Gonzalez-Rodriguez, Joaquin
LOQUENS, 2014, 1 (01):
[35] DYNAMIC FREQUENCY WARPING FOR SPEAKER ADAPTATION IN AUTOMATIC SPEECH RECOGNITION
PALIWAL, KK
AINSWORTH, WA
JOURNAL OF PHONETICS, 1985, 13 (02) : 123 - 134
[36] Spectral Analysis for Automatic Speech Recognition and Enhancement
Oruh, Jane
Viriri, Serestina
MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 245 - 254
[37] TEnet: target speaker extraction network with accumulated speaker embedding for automatic speech recognition
Li, Wenjie
Zhang, Pengyuan
Yan, Yonghong
ELECTRONICS LETTERS, 2019, 55 (14) : 816 - 818
[38] Multi-Stage Speech Enhancement for Automatic Speech Recognition
Lee, Seungyeol
Lee, Youngwoo
Cho, Namgook
2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2016,
[39] An Improved Switch Speech Enhancement Algorithm for Automatic Speech Recognition
Ma, Yongbao
Zhou, Yi
Liu, Jingang
Xia, Jie
Liu, Hongqing
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2015, : 430 - 435
[40] Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition
Novotny, Ondrej
Plchot, Oldrich
Glembek, Ondrej
Cernocky, Jan ''Honza''
Burget, Lukas
COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 403 - 421

← 1 2 3 4 5 →