Rapid BRIR generation approach using Variational Auto-Encoders and LSTM neural networks

被引:0
|
作者
Sanaguano-Moreno, D. [1 ]
Lucio-Naranjo, J. F. [1 ]
Tenenbaum, R. A. [2 ,3 ]
Sampaio-Regattieri, G. B. [2 ]
机构
[1] Escuela Politec Nacl, Dept Informat & Ciencias Comp, Quito, Pichincha, Ecuador
[2] Univ Fed Santa Maria, Curso Engn Acust, Santa Maria, RS, Brazil
[3] Univ Fed Santa Maria, Programa Posgrad Engn Civil, Santa Maria, RS, Brazil
关键词
Auralization; Acoustic Virtual Reality; Variational Auto-encoders; Binaural room impulse responses; Generative models; Long short-term memory; ROOM ACOUSTICS; AURALIZATION; PREDICTION;
D O I
10.1016/j.apacoust.2023.109721
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article proposes a rapid binaural room impulse response (BRIR) generation for real-time auralization with a moving receiver inside a room and a fixed sound source. The proposed method uses generative models, such as variational auto-encoders and long short-term memory neural networks. First, a number of static receivers are modeled within a given room, covering a representative area where listeners can typically be found. Then, a BRIR is simulated in each receiver, and from them, more BRIRs uniformly distributed in the scenario are synthesized through a data augmentation procedure. In the sequel, by applying an unsupervised model to cluster the receivers positionally, the BRIRs are organized into groups. Furthermore, variational autoencoder training is performed to reduce computational complexity. The results when comparing the reconstructed BRIRs with the original ones show that the error is within the just noticeable difference threshold. A computational cost analysis in terms of the total number of operations is carried out, achieving a computational gain of over 75%, and a storage reduction of more than 99% with respect to classic BRIR generation techniques. This opens the possibility of obtaining a reliable real-time auralization with a moving receiver inside the room.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Semi-Implicit Graph Variational Auto-Encoders
    Hasanzadeh, Arman
    Hajiramezanali, Ehsan
    Duffield, Nick
    Narayanan, Krishna
    Zhou, Mingyuan
    Qian, Xiaoning
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [22] Deep variational auto-encoders for unsupervised glomerular classification
    Lutnick, Brendon
    Yacoub, Rabi
    Jen, Kuang-Yu
    Tomaszewski, John E.
    Jain, Sanjay
    Sarder, Pinaki
    MEDICAL IMAGING 2018: DIGITAL PATHOLOGY, 2018, 10581
  • [23] Continuous Hierarchical Representations with Poincare Variational Auto-Encoders
    Mathieu, Emile
    Le Lan, Charline
    Maddison, Chris J.
    Tomioka, Ryota
    Teh, Yee Whye
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [24] Nonparametric Variational Auto-encoders for Hierarchical Representation Learning
    Goyal, Prasoon
    Hu, Zhiting
    Liang, Xiaodan
    Wang, Chenyu
    Xing, Eric P.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5104 - 5112
  • [25] A comprehensive investigation of variational auto-encoders for population synthesis
    Sane, Abdoul Razac
    Vandanjon, Pierre-Olivier
    Belaroussi, Rachid
    Hankach, Pierre
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2025, 8 (01):
  • [26] Improving explainability of deep neural network-based electrocardiogram interpretation using variational auto-encoders
    van de Leur, Rutger R.
    Bos, Max N.
    Taha, Karim
    Sammani, Arjan
    Yeung, Ming Wai
    van Duijvenboden, Stefan
    Lambiase, Pier D.
    Hassink, Rutger J.
    van der Harst, Pim
    Doevendans, Pieter A.
    Gupta, Deepak K.
    van Es, Rene
    EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2022, 3 (03): : 390 - 404
  • [27] Data-driven Dimensional Expression Generation via Encapsulated Variational Auto-Encoders
    Wenjun Bai
    Changqin Quan
    Zhi-Wei Luo
    Cognitive Computation, 2023, 15 : 1342 - 1354
  • [28] Fault detection Neural Differential Auto-encoders
    Goswami, Umang
    Kodamana, Hariprasad
    Ramteke, Manojkumar
    COMPUTERS & CHEMICAL ENGINEERING, 2024, 189
  • [29] Data-driven Dimensional Expression Generation via Encapsulated Variational Auto-Encoders
    Bai, Wenjun
    Quan, Changqin
    Luo, Zhi-Wei
    COGNITIVE COMPUTATION, 2023, 15 (04) : 1342 - 1354
  • [30] Anomaly Node Detection Method Based on Variational Graph Auto-Encoders in Attribute Networks
    Li Z.
    Jin X.
    Wang Y.
    Meng L.
    Zhuang C.
    Sun Z.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (01): : 17 - 25