Blind source separation by long-term monitoring: A variational autoencoder to validate the clustering analysis

被引:3
|
作者
De Salvio, Domenico [1 ,2 ]
Bianco, Michael J. J. [2 ]
Gerstoft, Peter [2 ]
D'Orazio, Dario [1 ]
Garai, Massimo [1 ]
机构
[1] Univ Bologna, Dept Ind Engn DIN, Viale Risorgimento 2, I-40136 Bologna, Italy
[2] Univ Calif San Diego, Scripps Inst Oceanog, NoiseLab, La Jolla, CA 92037 USA
来源
关键词
NOISE-LEVELS; SPEECH ENHANCEMENT;
D O I
10.1121/10.0016887
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Noise exposure influences the comfort and well-being of people in several contexts, such as work or learning environments. For instance, in offices, different kind of noises can increase or drop the employees' productivity. Thus, the ability of separating sound sources in real contexts plays a key role in assessing sound environments. Long-term monitoring provide large amounts of data that can be analyzed through machine and deep learning algorithms. Based on previous works, an entire working day was recorded through a sound level meter. Both sound pressure levels and the digital audio recording were collected. Then, a dual clustering analysis was carried out to separate the two main sound sources experienced by workers: traffic and speech noises. The first method exploited the occurrences of sound pressure levels via Gaussian mixture model and K-means clustering. The second analysis performed a semi-supervised deep clustering analyzing the latent space of a variational autoencoder. Results show that both approaches were able to separate the sound sources. Spectral matching and the latent space of the variational autoencoder validated the assumptions underlying the proposed clustering methods.
引用
收藏
页码:738 / 750
页数:13
相关论文
共 50 条
  • [41] Long-term disturbance monitoring for improved system analysis
    Balser, Steven J.
    Clark, Harrison K.
    IEEE Computer Applications in Power, 1992, v (0n): : 33 - 36
  • [42] Long-term monitoring and data analysis of the Tamar Bridge
    Cross, E. J.
    Koo, K. Y.
    Brownjohn, J. M. W.
    Worden, K.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2013, 35 (1-2) : 16 - 34
  • [43] Efficient Blind Source Separation Method for fMRI Using Autoencoder and Spatiotemporal Sparsity Constraints
    Khalid, Muhammad Usman
    Khawaja, Bilal A.
    Nauman, Malik Muhammad
    IEEE ACCESS, 2023, 11 : 50364 - 50381
  • [44] Analysis of traffic-induced vibrations by blind source separation with application in building monitoring
    Popescu, Th. D.
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2010, 80 (12) : 2374 - 2385
  • [45] Blind Source Separation for MT-InSAR Analysis With Structural Health Monitoring Applications
    Martin, Gabriel
    Hooper, Andrew
    Wright, Tim J.
    Selvakumaran, Sivasakthy
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 7605 - 7618
  • [46] INVESTIGATION AND COMPARISON OF OPTIMIZATION METHODS FOR VARIATIONAL AUTOENCODER-BASED UNDERDETERMINED MULTICHANNEL SOURCE SEPARATION
    Seki, Shogo
    Kameoka, Hirokazu
    Li, Li
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 511 - 515
  • [47] Short-term tests validate long-term estimates of climate change
    Palmer, Tim
    NATURE, 2020, 582 (7811) : 185 - 186
  • [48] A Special FCL Clustering and Its Application to Sparse Blind Source Separation
    Tong, Yu
    Zhang, Yunjie
    CEIS 2011, 2011, 15
  • [49] Mixing matrix estimation using discriminative clustering for blind source separation
    Thiagarajan, Jayaraman J.
    Ramamurthy, Karthikeyan Natesan
    Spanias, Andreas
    DIGITAL SIGNAL PROCESSING, 2013, 23 (01) : 9 - 18
  • [50] Underdetermined Blind Source Separation with Fuzzy Clustering for Arbitrarily Arranged Sensors
    Jafari, Ingrid
    Haque, Serajul
    Togneri, Roberto
    Nordholm, Sven
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1764 - +