DEREVERBERATION AND BEAMFORMING IN FAR-FIELD SPEAKER RECOGNITION

被引:0
|
作者
Mosner, Ladislav [1 ]
Matejka, Pavel
Novotny, Ondrej
Cernocky, Jan
机构
[1] Brno Univ Technol, Speech FIT, Brno, Czech Republic
关键词
Speaker recognition; microphone array; beamforming; dereverberation; audio retransmission; FRONT-END; SPEECH;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper deals with far-field speaker recognition. On a corpus of NIS T SRE 2010 data retransmitted in a real room with multiple microphones, we first demonstrate how room acoustics cause significant degradation of state-of-the-art i-vector based speaker recognition system. We then investigate several techniques to improve the performances ranging from probabilistic linear discriminant analysis (PLDA) re-training, through dereverberation, to beamforming. We found that weighted prediction error (WPE) based dereverberation combined with generalized eigenvalue beamformer with power-spectral density (PSD) weighting masks generated by neural networks (NN) provides results approaching the clean close-microphone setup. Further improvement was obtained by re-training PLDA or the mask-generating NNs on simulated target data. The work shows that a speaker recognition system working robustly in the far-field scenario can be developed.
引用
收藏
页码:5254 / 5258
页数:5
相关论文
共 50 条
  • [31] HoloBeam: Learning Optimal Beamforming in Far-Field Holographic Metasurface Transceivers
    Ghosh, Debamita
    Hanawal, Manjesh K.
    Zlatanov, Nikola
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 301 - 310
  • [32] The continuous refractive beamforming surface for far-field with robustness and high efficiency
    Yasui, Toshifumi
    Tahara, Hiroyuki
    Kono, Yusuke
    Yamamoto, Kohei
    Iwane, Tetsuaki
    Kashihara, Yoshiki
    Inada, Masanobu
    LASER BEAM SHAPING XXII, 2022, 12218
  • [33] Far-Field Localization for RIS Empowered Wireless Systems Leveraging Beamforming
    Alhafid, Abdulrahaman Kh.
    Ali, Y. E. Mohammed
    Younis, Sedki
    JORDAN JOURNAL OF ELECTRICAL ENGINEERING, 2024, 10 (03): : 484 - 499
  • [34] Adaptive and hybrid Kronecker product beamforming for far-field speech signals
    Sharma, Rajib
    Cohen, Israel
    Benesty, Jacob
    SPEECH COMMUNICATION, 2020, 120 (120) : 42 - 52
  • [35] Task-Specific Optimization of Virtual Channel Linear Prediction-Based Speech Dereverberation Front-End for Far-Field Speaker Verification
    Yang, Joon-Young
    Chang, Joon-Hyuk
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 3144 - 3159
  • [36] Task-Specific Optimization of Virtual Channel Linear Prediction-Based Speech Dereverberation Front-End for Far-Field Speaker Verification
    Yang, Joon-Young
    Chang, Joon-Hyuk
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 3144 - 3159
  • [37] Detecting Replay Attacks from Far-Field Recordings on Speaker Verification Systems
    Villalba, Jesus
    Lleida, Eduardo
    BIOMETRICS AND ID MANAGEMENT, 2011, 6583 : 274 - 285
  • [38] DEVELOPING FAR-FIELD SPEAKER SYSTEM VIA TEACHER-STUDENT LEARNING
    Li, Jinyu
    Zhao, Rui
    Chen, Zhuo
    Liu, Changliang
    Xiao, Xiong
    Ye, Guoli
    Gong, Yifan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5699 - 5703
  • [39] Spatially Robust Far-field Beamforming Using the von Mises(-Fisher) Distribution
    Anderson, Craig A.
    Teal, Paul D.
    Poletti, Mark A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (12) : 2189 - 2197
  • [40] AN END-TO-END FAR-FIELD KEYWORD SPOTTING SYSTEM WITH NEURAL BEAMFORMING
    Ji, Xuan
    Lu, Lu
    Fang, Fuming
    Ma, Jianbo
    Zhu, Lei
    Li, Jinke
    Zhao, Dongdi
    Liu, Ming
    Jiang, Feijun
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 892 - 899