Blind source separation by long-term monitoring: A variational autoencoder to validate the clustering analysis

被引:3
|
作者
De Salvio, Domenico [1 ,2 ]
Bianco, Michael J. J. [2 ]
Gerstoft, Peter [2 ]
D'Orazio, Dario [1 ]
Garai, Massimo [1 ]
机构
[1] Univ Bologna, Dept Ind Engn DIN, Viale Risorgimento 2, I-40136 Bologna, Italy
[2] Univ Calif San Diego, Scripps Inst Oceanog, NoiseLab, La Jolla, CA 92037 USA
来源
关键词
NOISE-LEVELS; SPEECH ENHANCEMENT;
D O I
10.1121/10.0016887
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Noise exposure influences the comfort and well-being of people in several contexts, such as work or learning environments. For instance, in offices, different kind of noises can increase or drop the employees' productivity. Thus, the ability of separating sound sources in real contexts plays a key role in assessing sound environments. Long-term monitoring provide large amounts of data that can be analyzed through machine and deep learning algorithms. Based on previous works, an entire working day was recorded through a sound level meter. Both sound pressure levels and the digital audio recording were collected. Then, a dual clustering analysis was carried out to separate the two main sound sources experienced by workers: traffic and speech noises. The first method exploited the occurrences of sound pressure levels via Gaussian mixture model and K-means clustering. The second analysis performed a semi-supervised deep clustering analyzing the latent space of a variational autoencoder. Results show that both approaches were able to separate the sound sources. Spectral matching and the latent space of the variational autoencoder validated the assumptions underlying the proposed clustering methods.
引用
收藏
页码:738 / 750
页数:13
相关论文
共 50 条
  • [1] Parameter-adaptive variational autoencoder for linear/nonlinear blind source separation
    Wei, Yuan-Hao
    Ni, Yi-Qing
    JOURNAL OF CIVIL STRUCTURAL HEALTH MONITORING, 2024, : 1161 - 1184
  • [2] Supervised Determined Source Separation with Multichannel Variational Autoencoder
    Kameoka, Hirokazu
    Li, Li
    Inoue, Shota
    Makino, Shoji
    NEURAL COMPUTATION, 2019, 31 (09) : 1891 - 1914
  • [3] Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation
    Seki, Shogo
    Kameoka, Hirokazu
    Li, Li
    Toda, Tomoki
    Takeda, Kazuya
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [4] VARIATIONAL EM FOR CLUSTERING INTERAURAL PHASE CUES IN MESSL FOR BLIND SOURCE SEPARATION OF SPEECH
    Zohny, Zeinab
    Naqvi, Syed Mohsen
    Chambers, Jonathon A.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 3966 - 3970
  • [5] Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder
    Seki, Shogo
    Kameoka, Hirokazu
    Li, Li
    Toda, Tomoki
    Takeda, Kazuya
    IEEE ACCESS, 2019, 7 : 168104 - 168115
  • [6] Speech Source Separation Using Variational Autoencoder and Bandpass Filter
    Do, Hao Duc
    Tran, Son Thai
    Chau, Duc Thanh
    IEEE ACCESS, 2020, 8 : 156219 - 156231
  • [7] Blind Source Separation Based on Variational Bayesian Independent Component Analysis
    Wang, Chunli
    Xu, Yan
    Tang, Minan
    Wang, Lei
    PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1614 - 1618
  • [8] Autoencoder based blind source separation for photoacoustic resolution enhancement
    Benyamin, Matan
    Genish, Hadar
    Califa, Ran
    Wolbromsky, Lauren
    Ganani, Michal
    Wang, Zhen
    Zhou, Shuyun
    Xie, Zheng
    Zalevsky, Zeev
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [9] Autoencoder based blind source separation for photoacoustic resolution enhancement
    Matan Benyamin
    Hadar Genish
    Ran Califa
    Lauren Wolbromsky
    Michal Ganani
    Zhen Wang
    Shuyun Zhou
    Zheng Xie
    Zeev Zalevsky
    Scientific Reports, 10
  • [10] Blind Source Separation in Machine Monitoring
    Popescu, Theodor D.
    2011 IEEE 54TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2011,