Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices

被引：1

作者：

Corey, Ryan M. ^{[1
]}

Mittal, Manan ^{[1
]}

Sarkar, Kanad ^{[1
]}

Singer, Andrew C. ^{[1
]}

机构：

[1] Univ Illinois, Urbana, IL 61801 USA

来源：

INTERSPEECH 2022 | 2022年

基金：

美国国家科学基金会;

关键词：

speech separation; distributed microphone array; asynchronous microphone array; wearable devices; SAMPLING FREQUENCY MISMATCH;

D O I：

10.21437/Interspeech.2022-11025

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We consider the problem of separating speech from several talkers in background noise using a fixed microphone array and a set of wearable devices. Wearable devices can provide reliable information about speech from their wearers, but they typically cannot be used directly for multichannel source separation due to network delay, sample rate offsets, and relative motion. Instead, the wearable microphone signals are used to compute the speech presence probability for each talker at each time-frequency index. Those parameters, which are robust against small sample rate offsets and relative motion, are used to track the second-order statistics of the speech sources and background noise. The fixed array then separates the speech signals using an adaptive linear time-varying multichannel Wiener filter. The proposed method is demonstrated using real-room recordings from three human talkers with binaural earbud microphones and an eight-microphone tabletop array.

引用

页码：5398 / 5402

页数：5

共 50 条

[1] COOPERATIVE AUDIO SOURCE SEPARATION AND ENHANCEMENT USING DISTRIBUTED MICROPHONE ARRAYS AND WEARABLE DEVICES
Corey, Ryan M.
Skarha, Matthew D.
Singer, Andrew C.
2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 296 - 300
[2] Microphone Array Speech Separation Algorithm based on DNN
Wu, Chaoyan
Zhou, Lin
Chen, Xijin
Chen, Liyuan
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1305 - 1310
[3] Microphone array beamforming approach to blind speech separation
Himawan, Ivan
McCowan, Iain
Lincoln, Mike
MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2008, 4892 : 295 - +
[4] SPEECH SEPARATION USING PARTIALLY ASYNCHRONOUS MICROPHONE ARRAYS WITHOUT RESAMPLING
Corey, Ryan M.
Singer, Andrew C.
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 111 - 115
[5] Speech enhancement using square microphone array for mobile devices
Takada, Shintaro
Ogawa, Tetsuji
Akagiri, Kenzo
Kobayashi, Tetsunori
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 313 - 316
[6] Wearable Speech Enhancement System based on MEMS Microphone Array for Disabled People
Palla, Alessandro
Fanucci, Luca
Sannino, Roberto
Settin, Mattia
2015 10TH IEEE INTERNATIONAL CONFERENCE ON DESIGN & TECHNOLOGY OF INTEGRATED SYSTEMS IN NANOSCALE ERA (DTIS), 2015,
[7] Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array
Asano, Futoshi
Yamamoto, Kiyoshi
Ogata, Jun
Yamada, Miichi
Nakamura, Andmasami
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
[8] Microphone Array Speech Separation Algorithm Based on TC-ResNet
Zhou, Lin
Xu, Yue
Wang, Tianyi
Feng, Kun
Shi, Jingang
CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (02): : 2705 - 2716
[9] DISTRIBUTED MICROPHONE ARRAY PROCESSING FOR SPEECH SOURCE SEPARATION WITH CLASSIFIER FUSION
Souden, Mehrez
Kinoshita, Keisuke
Delcroix, Marc
Nakatani, Tomohiro
2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,
[10] Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array
Futoshi Asano
Kiyoshi Yamamoto
Jun Ogata
Miichi Yamada
Masami Nakamura
EURASIP Journal on Audio, Speech, and Music Processing, 2007

← 1 2 3 4 5 →