Semi-Supervised Source Localization in Reverberant Environments with Deep Generative Modeling

被引:0
|
作者
Bianco, Michael J. [1 ]
Gannot, Sharon [2 ]
Fernandez-Grande, Efren [3 ]
Gerstoft, Peter [1 ]
机构
[1] Marine Physical Laboratory, University of California San Diego, San Diego,CA,92093, United States
[2] Faculty of Engineering, Bar-Ilan University, Ramat-Gan,5290002, Israel
[3] Department of Electrical Engineering, Technical University of Denmark, Kongens Lyngby,2800, Denmark
基金
欧盟地平线“2020”;
关键词
Multiple signal classification - Learning systems - Music - Signal processing - Reverberation - Supervised learning - Computer music;
D O I
暂无
中图分类号
学科分类号
摘要
Localization in reverberant environments remains an open challenge. Recently, supervised learning approaches have demonstrated very promising results in addressing reverberation. However, even with large data volumes, the number of labels available for supervised learning in such environments is usually small. We propose to address this issue with a semi-supervised learning (SSL) approach, based on deep generative modeling. Our chosen deep generative model, the variational autoencoder (VAE), is trained to generate the phase of relative transfer functions (RTFs) between microphones. In parallel, a direction of arrival (DOA) classifier network based on RTF-phase is also trained. The joint generative and discriminative model, deemed VAE-SSL, is trained using labeled and unlabeled RTF-phase sequences. In learning to generate and classify the sequences, the VAE-SSL extracts the physical causes of the RTF-phase (i.e., source location) from distracting signal characteristics such as noise and speech activity. This facilitates effective end-to-end operation of the VAE-SSL, which requires minimal preprocessing of RTF-phase. VAE-SSL is compared with two signal processing-based approaches, steered response power with phase transform (SRP-PHAT) and MUltiple SIgnal Classification (MUSIC), as well as fully supervised CNNs. The approaches are compared using data from two real acoustic environments - one of which was recently obtained at Technical University of Denmark specifically for our study. We find that VAE-SSL can outperform the conventional approaches and the CNN in label-limited scenarios. Further, the trained VAE-SSL system can generate new RTF-phase samples which capture the physics of the acoustic environment. Thus, the generative modeling in VAE-SSL provides a means of interpreting the learned representations. To the best of our knowledge, this paper presents the first approach to modeling the physics of acoustic propagation using deep generative modeling. © 2013 IEEE.
引用
收藏
页码:84956 / 84970
相关论文
共 50 条
  • [41] Semi-supervised protein subcellular localization
    Qian Xu
    Derek Hao Hu
    Hong Xue
    Weichuan Yu
    Qiang Yang
    BMC Bioinformatics, 10
  • [42] BL-GAN: Semi-Supervised Bug Localization via Generative Adversarial Network
    Zhu, Ziye
    Tong, Hanghang
    Wang, Yu
    Li, Yun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (11) : 11112 - 11125
  • [43] Wi-Fi Fingerprint Indoor Localization by Semi-Supervised Generative Adversarial Network
    Yoo, Jaehyun
    SENSORS, 2024, 24 (17)
  • [44] Deep Graph-Convolutional Generative Adversarial Network for Semi-Supervised Learning on Graphs
    Jia, Nan
    Tian, Xiaolin
    Gao, Wenxing
    Jiao, Licheng
    REMOTE SENSING, 2023, 15 (12)
  • [45] SEMI-SUPERVISED DEEP GENERATIVE MODELS FOR CHANGE DETECTION IN VERY HIGH RESOLUTION IMAGERY
    Connors, Clayton
    Vatsavai, Ranga Raju
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 1063 - 1066
  • [46] Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional Data
    Du, Changde
    Du, Changying
    Wang, Hao
    Li, Jinpeng
    Zheng, Wei-Long
    Lu, Bao-Liang
    He, Huiguang
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 108 - 116
  • [47] Semi-Supervised Semantic Image Segmentation by Deep Diffusion Models and Generative Adversarial Networks
    Diaz-Frances, Jose Angel
    Fernandez-Rodriguez, Jose David
    Thurnhofer-Hemsi, Karl
    Lopez-Rubio, Ezequiel
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2024, 34 (11)
  • [48] SEMI-SUPERVISED DEEP LEARNING SEISMIC IMPEDANCE INVERSION USING GENERATIVE ADVERSARIAL NETWORKS
    Meng, Delin
    Wu, Bangyu
    Liu, Naihao
    Chen, Wenchao
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1393 - 1396
  • [49] FMixCutMatch for semi-supervised deep learning
    Wei, Xiang
    Wei, Xiaotao
    Kong, Xiangyuan
    Lu, Siyang
    Xing, Weiwei
    Lu, Wei
    Neural Networks, 2021, 133 : 166 - 176
  • [50] Close Sound Source Localization incorporating Semi-Supervised Variational Bayesian NMF
    Kumon, Makoto
    Washizaki, Kai
    Nakadai, Kazuhiro
    2019 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2019, : 313 - 318