ROBUST SPEAKER DOA ESTIMATION WITH SINGLE AVS IN BISPECTRUM DOMAIN

被引:0
|
作者
Jin, Y. H. [1 ]
Zou, Y. X. [1 ]
机构
[1] Peking Univ, ADSPLAB, Sch ECE, Shenzhen 518055, Peoples R China
关键词
Direction of arrival estimation; acoustic vector sensor; bispectrum inter-sensor data ratio; interference; VECTOR SENSOR ARRAY; SOURCE LOCALIZATION; MULTIPATH;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For mobile speech application, speaker DOA estimation accuracy, interference robustness and compact physical size are three key factors. Considering the size, we utilized acoustic vector sensor (AVS) and proposed a DOA estimation algorithm previously [1], offering high accuracy with larger-than-15dB SNR but is deteriorated by non-speech interferences (NSI). This paper develops a robust speaker DOA estimation algorithm. It is achieved by deriving the inter-sensor data ratio model of an AVS in bispectrum domain (BISDR) and exploring the favorable properties of bispectrum, such as zero value of Gaussian process and different distribution of speech and NSI. Specifically, a reliable bispectrum mask is generated to guarantee that the speaker DOA cues, derived from BISDR, are robust to NSI in terms of speech sparsity and large bispectrum amplitude of the captured signals. Intensive experiments demonstrate an improved performance of our proposed algorithm under various NSI conditions even when SIR is smaller than 0dB.
引用
收藏
页码:3196 / 3200
页数:5
相关论文
共 50 条
  • [1] ROBUST SPEAKER DOA ESTIMATION BASED ON THE INTER-SENSOR DATA RATIO MODEL AND BINARY MASK ESTIMATION IN THE BISPECTRUM DOMAIN
    Jin, Yanhan
    Zou, Yuexian
    Ritz, C. H.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 3266 - 3270
  • [2] Bispectrum features for robust speaker identification
    Wenndt, S
    Shamsunder, S
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1095 - 1098
  • [3] A Robust High Resolution Speaker DOA Estimation under Reverberant Environment
    Guo, Yifan
    Zou, Y. X.
    Wang, Yongqing
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 400 - 400
  • [4] BEARING ESTIMATION IN THE BISPECTRUM DOMAIN
    FORSTER, P
    NIKIAS, CL
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (09) : 1994 - 2006
  • [5] A Robust hierarchical motion estimation algorithm in noisy image sequences in the bispectrum domain
    Alaoui, El Mehdi Ismaili
    Ibn-Elhaj, Elhassane
    SIGNAL IMAGE AND VIDEO PROCESSING, 2009, 3 (03) : 291 - 302
  • [6] A Robust hierarchical motion estimation algorithm in noisy image sequences in the bispectrum domain
    El Mehdi Ismaili Alaoui
    Elhassane Ibn-Elhaj
    Signal, Image and Video Processing, 2009, 3 : 291 - 302
  • [7] Joint Noise and Reverberation Adaptive Learning for Robust Speaker DOA Estimation with An Acoustic Vector Sensor
    Wang, Disong
    Zou, Yuexian
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 821 - 825
  • [8] Robust wideband DOA estimation
    Sellone, Fabrizio
    2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 249 - 254
  • [9] Robust text-independent speaker identification using bispectrum slice
    Özkurt, TE
    Akgül, T
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 418 - 421
  • [10] Fourth-Order Cumulants based Underdetermined 2-D DOA Estimation using Single AVS
    Sharma, Umesh
    Agrawal, Monika
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,