Semi-Supervised Source Localization in Reverberant Environments with Deep Generative Modeling

被引：0

作者：

Bianco, Michael J. ^{[1
]}

Gannot, Sharon ^{[2
]}

Fernandez-Grande, Efren ^{[3
]}

Gerstoft, Peter ^{[1
]}

机构：

[1] Marine Physical Laboratory, University of California San Diego, San Diego,CA,92093, United States

[2] Faculty of Engineering, Bar-Ilan University, Ramat-Gan,5290002, Israel

[3] Department of Electrical Engineering, Technical University of Denmark, Kongens Lyngby,2800, Denmark

来源：

IEEE Access | 2021年 / 9卷

基金：

欧盟地平线“2020”;

关键词：

Multiple signal classification - Learning systems - Music - Signal processing - Reverberation - Supervised learning - Computer music;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Localization in reverberant environments remains an open challenge. Recently, supervised learning approaches have demonstrated very promising results in addressing reverberation. However, even with large data volumes, the number of labels available for supervised learning in such environments is usually small. We propose to address this issue with a semi-supervised learning (SSL) approach, based on deep generative modeling. Our chosen deep generative model, the variational autoencoder (VAE), is trained to generate the phase of relative transfer functions (RTFs) between microphones. In parallel, a direction of arrival (DOA) classifier network based on RTF-phase is also trained. The joint generative and discriminative model, deemed VAE-SSL, is trained using labeled and unlabeled RTF-phase sequences. In learning to generate and classify the sequences, the VAE-SSL extracts the physical causes of the RTF-phase (i.e., source location) from distracting signal characteristics such as noise and speech activity. This facilitates effective end-to-end operation of the VAE-SSL, which requires minimal preprocessing of RTF-phase. VAE-SSL is compared with two signal processing-based approaches, steered response power with phase transform (SRP-PHAT) and MUltiple SIgnal Classification (MUSIC), as well as fully supervised CNNs. The approaches are compared using data from two real acoustic environments - one of which was recently obtained at Technical University of Denmark specifically for our study. We find that VAE-SSL can outperform the conventional approaches and the CNN in label-limited scenarios. Further, the trained VAE-SSL system can generate new RTF-phase samples which capture the physics of the acoustic environment. Thus, the generative modeling in VAE-SSL provides a means of interpreting the learned representations. To the best of our knowledge, this paper presents the first approach to modeling the physics of acoustic propagation using deep generative modeling. © 2013 IEEE.

引用

页码：84956 / 84970

共 50 条

[41] Semi-supervised protein subcellular localization
Qian Xu
Derek Hao Hu
Hong Xue
Weichuan Yu
Qiang Yang
BMC Bioinformatics, 10
[42] BL-GAN: Semi-Supervised Bug Localization via Generative Adversarial Network
Zhu, Ziye
Tong, Hanghang
Wang, Yu
Li, Yun
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11112 - 11125
[43] Wi-Fi Fingerprint Indoor Localization by Semi-Supervised Generative Adversarial Network
Yoo, Jaehyun
SENSORS, 2024, 24 (17)
[44] Deep Graph-Convolutional Generative Adversarial Network for Semi-Supervised Learning on Graphs
Jia, Nan
Tian, Xiaolin
Gao, Wenxing
Jiao, Licheng
REMOTE SENSING, 2023, 15 (12)
[45] SEMI-SUPERVISED DEEP GENERATIVE MODELS FOR CHANGE DETECTION IN VERY HIGH RESOLUTION IMAGERY
Connors, Clayton
Vatsavai, Ranga Raju
2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 1063 - 1066
[46] Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional Data
Du, Changde
Du, Changying
Wang, Hao
Li, Jinpeng
Zheng, Wei-Long
Lu, Bao-Liang
He, Huiguang
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 108 - 116
[47] Semi-Supervised Semantic Image Segmentation by Deep Diffusion Models and Generative Adversarial Networks
Diaz-Frances, Jose Angel
Fernandez-Rodriguez, Jose David
Thurnhofer-Hemsi, Karl
Lopez-Rubio, Ezequiel
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2024, 34 (11)
[48] SEMI-SUPERVISED DEEP LEARNING SEISMIC IMPEDANCE INVERSION USING GENERATIVE ADVERSARIAL NETWORKS
Meng, Delin
Wu, Bangyu
Liu, Naihao
Chen, Wenchao
IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1393 - 1396
[49] FMixCutMatch for semi-supervised deep learning
Wei, Xiang
Wei, Xiaotao
Kong, Xiangyuan
Lu, Siyang
Xing, Weiwei
Lu, Wei
Neural Networks, 2021, 133 : 166 - 176
[50] Close Sound Source Localization incorporating Semi-Supervised Variational Bayesian NMF
Kumon, Makoto
Washizaki, Kai
Nakadai, Kazuhiro
2019 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2019, : 313 - 318

← 1 2 3 4 5 →