Blind source separation by long-term monitoring: A variational autoencoder to validate the clustering analysis

被引:3
|
作者
De Salvio, Domenico [1 ,2 ]
Bianco, Michael J. J. [2 ]
Gerstoft, Peter [2 ]
D'Orazio, Dario [1 ]
Garai, Massimo [1 ]
机构
[1] Univ Bologna, Dept Ind Engn DIN, Viale Risorgimento 2, I-40136 Bologna, Italy
[2] Univ Calif San Diego, Scripps Inst Oceanog, NoiseLab, La Jolla, CA 92037 USA
来源
关键词
NOISE-LEVELS; SPEECH ENHANCEMENT;
D O I
10.1121/10.0016887
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Noise exposure influences the comfort and well-being of people in several contexts, such as work or learning environments. For instance, in offices, different kind of noises can increase or drop the employees' productivity. Thus, the ability of separating sound sources in real contexts plays a key role in assessing sound environments. Long-term monitoring provide large amounts of data that can be analyzed through machine and deep learning algorithms. Based on previous works, an entire working day was recorded through a sound level meter. Both sound pressure levels and the digital audio recording were collected. Then, a dual clustering analysis was carried out to separate the two main sound sources experienced by workers: traffic and speech noises. The first method exploited the occurrences of sound pressure levels via Gaussian mixture model and K-means clustering. The second analysis performed a semi-supervised deep clustering analyzing the latent space of a variational autoencoder. Results show that both approaches were able to separate the sound sources. Spectral matching and the latent space of the variational autoencoder validated the assumptions underlying the proposed clustering methods.
引用
收藏
页码:738 / 750
页数:13
相关论文
共 50 条
  • [21] LONG-TERM EARTHQUAKE CLUSTERING
    KAGAN, YY
    JACKSON, DD
    GEOPHYSICAL JOURNAL INTERNATIONAL, 1991, 104 (01) : 117 - 133
  • [22] LONG-TERM POST ACCIDENT CHEMISTRY IN SOURCE TERM ANALYSIS
    CLOUGH, PN
    MULLINS, JR
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1988, 195 : 104 - NUCL
  • [23] Blind source separation by weighted K-means clustering
    Yi Qingming Dept. of Electronic Engineering
    JournalofSystemsEngineeringandElectronics, 2008, (05) : 882 - 887
  • [24] Blind source separation by weighted K-means clustering
    Yi Qingming
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2008, 19 (05) : 882 - 887
  • [25] INTEGRATION OF VARIATIONAL AUTOENCODER AND SPATIAL CLUSTERING FOR ADAPTIVE MULTI-CHANNEL NEURAL SPEECH SEPARATION
    Zmolikova, Katerina
    Delcroix, Marc
    Burget, Lukas
    Nakatani, Tomohiro
    Cernocky, Jan Honza
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 889 - 896
  • [26] Exploitation of source nonstationarity in underdetermined blind source separation with advanced clustering techniques
    Luo, Yuhui
    Wang, Wenwu
    Chambers, Jonathon A.
    Lambotharan, Sangarapillai
    Proudler, Ian
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) : 2198 - 2212
  • [27] Blind Source Separation: A Review and Analysis
    Pal, Madhab
    Roy, Rajib
    Basu, Joyanta
    Bepari, Milton S.
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [28] Underdetermined Blind Source Separation Based Condition Monitoring
    Vinaya, Anindita Adikaputri
    Arifianto, Dhany
    2015 INTERNATIONAL CONFERENCE ON SCIENCE IN INFORMATION TECHNOLOGY (ICSITECH), 2015, : 47 - 52
  • [29] Blind Source Separation: A Preprocessing Tool for Monitoring of Structures
    Popescu, Theodor D.
    Alexandru, Adriana.
    2018 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR), 2018,
  • [30] Blind source separation using variational expectation-maximization algorithm
    Nasios, N
    Bors, AG
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2003, 2756 : 442 - 450