Blind source separation by long-term monitoring: A variational autoencoder to validate the clustering analysis

被引:3
|
作者
De Salvio, Domenico [1 ,2 ]
Bianco, Michael J. J. [2 ]
Gerstoft, Peter [2 ]
D'Orazio, Dario [1 ]
Garai, Massimo [1 ]
机构
[1] Univ Bologna, Dept Ind Engn DIN, Viale Risorgimento 2, I-40136 Bologna, Italy
[2] Univ Calif San Diego, Scripps Inst Oceanog, NoiseLab, La Jolla, CA 92037 USA
来源
关键词
NOISE-LEVELS; SPEECH ENHANCEMENT;
D O I
10.1121/10.0016887
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Noise exposure influences the comfort and well-being of people in several contexts, such as work or learning environments. For instance, in offices, different kind of noises can increase or drop the employees' productivity. Thus, the ability of separating sound sources in real contexts plays a key role in assessing sound environments. Long-term monitoring provide large amounts of data that can be analyzed through machine and deep learning algorithms. Based on previous works, an entire working day was recorded through a sound level meter. Both sound pressure levels and the digital audio recording were collected. Then, a dual clustering analysis was carried out to separate the two main sound sources experienced by workers: traffic and speech noises. The first method exploited the occurrences of sound pressure levels via Gaussian mixture model and K-means clustering. The second analysis performed a semi-supervised deep clustering analyzing the latent space of a variational autoencoder. Results show that both approaches were able to separate the sound sources. Spectral matching and the latent space of the variational autoencoder validated the assumptions underlying the proposed clustering methods.
引用
收藏
页码:738 / 750
页数:13
相关论文
共 50 条
  • [31] Deep clustering analysis via variational autoencoder with Gamma mixture latent embeddings
    Guo, Jiaxun
    Fan, Wentao
    Amayri, Manar
    Bouguila, Nizar
    NEURAL NETWORKS, 2025, 183
  • [32] Deep Clustering Analysis via Dual Variational Autoencoder With Spherical Latent Embeddings
    Yang, Lin
    Fan, Wentao
    Bouguila, Nizar
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6303 - 6312
  • [33] Long-Term Monitoring of the Effectiveness of a Ground Source Heat Pump Borehole
    Rubinova, Olga
    Ambrozova, Iva
    Horak, Petr
    ENVIBUILD 2014, 2014, 1041 : 125 - 128
  • [34] BLIND AUDIO SOURCE SEPARATION USING SHORT plus LONG TERM AR SOURCE MODELS AND SPECTRUM MATCHING
    Schutz, Antony
    Slock, Dirk
    2011 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP AND IEEE SIGNAL PROCESSING EDUCATION WORKSHOP (DSP/SPE), 2011, : 112 - 115
  • [35] Long-Term Monitoring of Infiltration Trench for Nonpoint Source Pollution Control
    Marla C. Maniquiz
    So-Young Lee
    Lee-Hyung Kim
    Water, Air, & Soil Pollution, 2010, 212 : 13 - 26
  • [36] Long-Term Monitoring of Infiltration Trench for Nonpoint Source Pollution Control
    Maniquiz, Marla C.
    Lee, So-Young
    Kim, Lee-Hyung
    WATER AIR AND SOIL POLLUTION, 2010, 212 (1-4): : 13 - 26
  • [37] Blind Source Separation of Different Retinal Pulsatile Patterns from Simultaneous Long-term Binocular Ophthalmoscopic Video-records
    Labounkova, Ivana
    Labounek, Rene
    Odstrcilik, Jan
    Hracho, Michal
    Nestrasil, Igor
    Tornow, Ralf P.
    Kolar, Radim
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 4729 - 4732
  • [38] Long-term temperature prediction with hybrid autoencoder algorithms
    Perez-Aracil, J.
    Fister, D.
    Marina, C. M.
    Pelaez-Rodriguez, C.
    Cornejo-Bueno, L.
    Gutierrez, P. A.
    Giuliani, M.
    Castelleti, A.
    Salcedo-Sanz, S.
    APPLIED COMPUTING AND GEOSCIENCES, 2024, 23
  • [39] An Underdetermined Blind Source Separation Algorithm based on Clustering Analysis and Time-frequency Representation
    Yu Lu
    Qu Jian-ling
    Gao Feng
    Tian Yan-ping
    PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 1951 - 1956
  • [40] Long-term Monitoring and Data Analysis of the Tamar Bridge
    Cross, E. J.
    Koo, K. Y.
    Brownjohn, J. M. W.
    Worden, K.
    PROCEEDINGS OF ISMA2010 - INTERNATIONAL CONFERENCE ON NOISE AND VIBRATION ENGINEERING INCLUDING USD2010, 2010, : 1345 - 1357