Rapid BRIR generation approach using Variational Auto-Encoders and LSTM neural networks

被引:0
|
作者
Sanaguano-Moreno, D. [1 ]
Lucio-Naranjo, J. F. [1 ]
Tenenbaum, R. A. [2 ,3 ]
Sampaio-Regattieri, G. B. [2 ]
机构
[1] Escuela Politec Nacl, Dept Informat & Ciencias Comp, Quito, Pichincha, Ecuador
[2] Univ Fed Santa Maria, Curso Engn Acust, Santa Maria, RS, Brazil
[3] Univ Fed Santa Maria, Programa Posgrad Engn Civil, Santa Maria, RS, Brazil
关键词
Auralization; Acoustic Virtual Reality; Variational Auto-encoders; Binaural room impulse responses; Generative models; Long short-term memory; ROOM ACOUSTICS; AURALIZATION; PREDICTION;
D O I
10.1016/j.apacoust.2023.109721
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article proposes a rapid binaural room impulse response (BRIR) generation for real-time auralization with a moving receiver inside a room and a fixed sound source. The proposed method uses generative models, such as variational auto-encoders and long short-term memory neural networks. First, a number of static receivers are modeled within a given room, covering a representative area where listeners can typically be found. Then, a BRIR is simulated in each receiver, and from them, more BRIRs uniformly distributed in the scenario are synthesized through a data augmentation procedure. In the sequel, by applying an unsupervised model to cluster the receivers positionally, the BRIRs are organized into groups. Furthermore, variational autoencoder training is performed to reduce computational complexity. The results when comparing the reconstructed BRIRs with the original ones show that the error is within the just noticeable difference threshold. A computational cost analysis in terms of the total number of operations is carried out, achieving a computational gain of over 75%, and a storage reduction of more than 99% with respect to classic BRIR generation techniques. This opens the possibility of obtaining a reliable real-time auralization with a moving receiver inside the room.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Description Generation Using Variational Auto-Encoders for Precursor microRNA
    Petkovic, Marko
    Menkovski, Vlado
    ENTROPY, 2024, 26 (11)
  • [2] Correlated Variational Auto-Encoders
    Tang, Da
    Liang, Dawen
    Jebara, Tony
    Ruozzi, Nicholas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [3] Hyperspherical Variational Auto-Encoders
    Davidson, Tim R.
    Falorsi, Luca
    De Cao, Nicola
    Kipf, Thomas
    Tomczak, Jakub M.
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 856 - 865
  • [4] Generation and Extraction of Color Palettes with Adversarial Variational Auto-Encoders
    Moussa, Ahmad
    Watanabe, Hiroshi
    PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 889 - 897
  • [5] Unsupervised Anomaly Localization Using Variational Auto-Encoders
    Zimmerer, David
    Isensee, Fabian
    Petersen, Jens
    Kohl, Simon
    Maier-Hein, Klaus
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT IV, 2019, 11767 : 289 - 297
  • [6] Monte Carlo Variational Auto-Encoders
    Thin, Achille
    Kotelevskii, Nikita
    Durmus, Alain
    Panov, Maxim
    Moulines, Eric
    Doucet, Arnaud
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7258 - 7267
  • [7] Consistency Regularization for Variational Auto-Encoders
    Sinha, Samarth
    Dieng, Adji B.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] Adversarial Training of Variational Auto-encoders for High Fidelity Image Generation
    Khan, Salman H.
    Hayat, Munawar
    Barnes, Nick
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1312 - 1320
  • [9] Radon-Sobolev Variational Auto-Encoders
    Turinici, Gabriel
    NEURAL NETWORKS, 2021, 141 : 294 - 305
  • [10] Self-Supervised Variational Auto-Encoders
    Gatopoulos, Ioannis
    Tomczak, Jakub M.
    ENTROPY, 2021, 23 (06)