Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices

被引:1
|
作者
Corey, Ryan M. [1 ]
Mittal, Manan [1 ]
Sarkar, Kanad [1 ]
Singer, Andrew C. [1 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
来源
INTERSPEECH 2022 | 2022年
基金
美国国家科学基金会;
关键词
speech separation; distributed microphone array; asynchronous microphone array; wearable devices; SAMPLING FREQUENCY MISMATCH;
D O I
10.21437/Interspeech.2022-11025
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We consider the problem of separating speech from several talkers in background noise using a fixed microphone array and a set of wearable devices. Wearable devices can provide reliable information about speech from their wearers, but they typically cannot be used directly for multichannel source separation due to network delay, sample rate offsets, and relative motion. Instead, the wearable microphone signals are used to compute the speech presence probability for each talker at each time-frequency index. Those parameters, which are robust against small sample rate offsets and relative motion, are used to track the second-order statistics of the speech sources and background noise. The fixed array then separates the speech signals using an adaptive linear time-varying multichannel Wiener filter. The proposed method is demonstrated using real-room recordings from three human talkers with binaural earbud microphones and an eight-microphone tabletop array.
引用
收藏
页码:5398 / 5402
页数:5
相关论文
共 50 条
  • [1] COOPERATIVE AUDIO SOURCE SEPARATION AND ENHANCEMENT USING DISTRIBUTED MICROPHONE ARRAYS AND WEARABLE DEVICES
    Corey, Ryan M.
    Skarha, Matthew D.
    Singer, Andrew C.
    2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 296 - 300
  • [2] Microphone Array Speech Separation Algorithm based on DNN
    Wu, Chaoyan
    Zhou, Lin
    Chen, Xijin
    Chen, Liyuan
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1305 - 1310
  • [3] Microphone array beamforming approach to blind speech separation
    Himawan, Ivan
    McCowan, Iain
    Lincoln, Mike
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2008, 4892 : 295 - +
  • [4] SPEECH SEPARATION USING PARTIALLY ASYNCHRONOUS MICROPHONE ARRAYS WITHOUT RESAMPLING
    Corey, Ryan M.
    Singer, Andrew C.
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 111 - 115
  • [5] Speech enhancement using square microphone array for mobile devices
    Takada, Shintaro
    Ogawa, Tetsuji
    Akagiri, Kenzo
    Kobayashi, Tetsunori
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 313 - 316
  • [6] Wearable Speech Enhancement System based on MEMS Microphone Array for Disabled People
    Palla, Alessandro
    Fanucci, Luca
    Sannino, Roberto
    Settin, Mattia
    2015 10TH IEEE INTERNATIONAL CONFERENCE ON DESIGN & TECHNOLOGY OF INTEGRATED SYSTEMS IN NANOSCALE ERA (DTIS), 2015,
  • [7] Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array
    Asano, Futoshi
    Yamamoto, Kiyoshi
    Ogata, Jun
    Yamada, Miichi
    Nakamura, Andmasami
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [8] Microphone Array Speech Separation Algorithm Based on TC-ResNet
    Zhou, Lin
    Xu, Yue
    Wang, Tianyi
    Feng, Kun
    Shi, Jingang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (02): : 2705 - 2716
  • [9] DISTRIBUTED MICROPHONE ARRAY PROCESSING FOR SPEECH SOURCE SEPARATION WITH CLASSIFIER FUSION
    Souden, Mehrez
    Kinoshita, Keisuke
    Delcroix, Marc
    Nakatani, Tomohiro
    2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,
  • [10] Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array
    Futoshi Asano
    Kiyoshi Yamamoto
    Jun Ogata
    Miichi Yamada
    Masami Nakamura
    EURASIP Journal on Audio, Speech, and Music Processing, 2007