DEREVERBERATION AND BEAMFORMING IN FAR-FIELD SPEAKER RECOGNITION

被引:0
|
作者
Mosner, Ladislav [1 ]
Matejka, Pavel
Novotny, Ondrej
Cernocky, Jan
机构
[1] Brno Univ Technol, Speech FIT, Brno, Czech Republic
关键词
Speaker recognition; microphone array; beamforming; dereverberation; audio retransmission; FRONT-END; SPEECH;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper deals with far-field speaker recognition. On a corpus of NIS T SRE 2010 data retransmitted in a real room with multiple microphones, we first demonstrate how room acoustics cause significant degradation of state-of-the-art i-vector based speaker recognition system. We then investigate several techniques to improve the performances ranging from probabilistic linear discriminant analysis (PLDA) re-training, through dereverberation, to beamforming. We found that weighted prediction error (WPE) based dereverberation combined with generalized eigenvalue beamformer with power-spectral density (PSD) weighting masks generated by neural networks (NN) provides results approaching the clean close-microphone setup. Further improvement was obtained by re-training PLDA or the mask-generating NNs on simulated target data. The work shows that a speaker recognition system working robustly in the far-field scenario can be developed.
引用
收藏
页码:5254 / 5258
页数:5
相关论文
共 50 条
  • [1] Dereverberation and Beamforming in Robust Far-Field Speaker Recognition
    Masner, Ladislav
    Plchot, Oldrich
    Matejka, Pavel
    Novotny, Ondrej
    Cernocky, Jan Honza
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1334 - 1338
  • [2] End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming
    Zhang, Wangyou
    Subramanian, Aswin Shanmugam
    Chang, Xuankai
    Watanabe, Shinji
    Qian, Yanmin
    INTERSPEECH 2020, 2020, : 324 - 328
  • [3] Far-field speaker recognition
    Jin, Qin
    Schultz, Tanja
    Waibel, Alex
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2023 - 2032
  • [4] Far-field speaker recognition
    Jin, Qin
    Pan, Yue
    Schultz, Tanja
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 937 - 940
  • [5] Dereverberation of autoregressive envelopes for far-field speech recognition
    Purushothaman, Anurenjan
    Sreeram, Anirudh
    Kumar, Rohit
    Ganapathy, Sriram
    COMPUTER SPEECH AND LANGUAGE, 2022, 72
  • [6] Far-field continuous speech recognition system based on speaker localization and sub-band beamforming
    Asaei, Afsaneh
    Taghizadeh, Mohammad Javad
    Sameti, Hossein
    2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, : 495 - +
  • [7] STRUCTURAL SPARSIFICATION FOR FAR-FIELD SPEAKER RECOGNITION WITH INTEL® GNA
    Zhang, Jingchi
    Huang, Jonathan
    Deisher, Michael
    Li, Hai
    Chen, Yiran
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3037 - 3041
  • [8] Far-Field Speaker Recognition Benchmark Derived From The DiPCo Corpus
    Rouvier, Mickael
    Mohammadamini, Mohammad
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1955 - 1959
  • [9] Intel Far-field Speaker Recognition System for VOiCES Challenge 2019
    Huang, Jonathan
    Bocklet, Tobias
    INTERSPEECH 2019, 2019, : 2473 - 2477
  • [10] STC-innovation Speaker Recognition Systems for Far-Field Speaker Verification Challenge 2020
    Gusev, Aleksei
    Volokhov, Vladimir
    Vinogradova, Alisa
    Andzhukaev, Tseren
    Shulipa, Andrey
    Novoselov, Sergey
    Pekhovsky, Timur
    Kozlov, Alexander
    INTERSPEECH 2020, 2020, : 3466 - 3470