Latent Timbre SynthesisAudio-based variational auto-encoders for music composition and sound design applications

被引:0
|
作者
Kıvanç Tatar
Daniel Bisig
Philippe Pasquier
机构
[1] Simon Fraser University,
[2] Zurich University of the Arts,undefined
来源
关键词
Audio synthesis; Neural networks; Signal processing; Computer assisted music composition;
D O I
暂无
中图分类号
学科分类号
摘要
We present the Latent Timbre Synthesis, a new audio synthesis method using deep learning. The synthesis method allows composers and sound designers to interpolate and extrapolate between the timbre of multiple sounds using the latent space of audio frames. We provide the details of two Variational Autoencoder architectures for the Latent Timbre Synthesis and compare their advantages and drawbacks. The implementation includes a fully working application with a graphical user interface, called interpolate_two, which enables practitioners to generate timbres between two audio excerpts of their selection using interpolation and extrapolation in the latent space of audio frames. Our implementation is open source, and we aim to improve the accessibility of this technology by providing a guide for users with any technical background. Our study includes a qualitative analysis where nine composers evaluated the Latent Timbre Synthesis and the interpolate_two application within their practices.
引用
收藏
页码:67 / 84
页数:17
相关论文
共 33 条
  • [21] A CONNECTED AUTO-ENCODERS BASED APPROACH FOR IMAGE SEPARATION WITH SIDE INFORMATION: WITH APPLICATIONS TO ART INVESTIGATION
    Pu, Wei
    Sober, Barak
    Daly, Nathan
    Higgitt, Catherine
    Daubechies, Ingrid
    Rodrigues, Miguel R. D.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2213 - 2217
  • [22] A connected auto-encoders based approach for image separation with side information: With applications to art investigation
    Pu, Wei
    Sober, Barak
    Daly, Nathan
    Higgitt, Catherine
    Daubechies, Ingrid
    Rodrigues, Miguel R.D.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2020, 2020-May : 2213 - 2217
  • [23] DEFORMABLE MRI TO TRANSRECTAL ULTRASOUND REGISTRATION FOR PROSTATE INTERVENTIONS WITH SHAPE-BASED DEEP VARIATIONAL AUTO-ENCODERS
    Shakeri, Shirin
    Le, William
    Menard, Cynthia
    Kadoury, Samuel
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 174 - 178
  • [24] BRAIN SUBTLE ANOMALY DETECTION BASED ON AUTO-ENCODERS LATENT SPACE ANALYSIS: APPLICATION TO DE NOVO PARKINSON PATIENTS
    Pinon, Nicolas
    Oudoumanessah, Geoffroy
    Trombetta, Robin
    Dojat, Michel
    Forbes, Florence
    Lartizien, Carole
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [25] 3D reconstruction of digital cores based on a model using generative adversarial networks and variational auto-encoders
    Zhang, Ting
    Xia, Pengfei
    Lu, Fangfang
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2021, 207
  • [26] A semantic-based method for analysing unknown malicious behaviours via hyper-spherical variational auto-encoders
    Wang, Yi-feng
    Guo, Yuan-bo
    Fang, Chen
    IET INFORMATION SECURITY, 2023, 17 (02) : 244 - 254
  • [27] A Multi-Task Graph Neural Network with Variational Graph Auto-Encoders for Session-Based Travel Packages Recommendation
    Zhu, Guixiang
    Cao, Jie
    Chen, Lei
    Wang, Youquan
    Bu, Zhan
    Yang, Shuxin
    Wu, Jianqing
    Wang, Zhiping
    ACM TRANSACTIONS ON THE WEB, 2023, 17 (03)
  • [28] Variational auto-encoders improve explainability over currently employed heatmap methods for deep learning-based interpretation of the electrocardiogram
    van de Leur, Rutger R.
    Hassink, Rutger J.
    van Es, Rene
    EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2022, 3 (04): : 502 - 504
  • [29] Learning Conditional Postural Synergies for Dexterous Hands: A Generative Approach Based on Variational Auto-Encoders and Conditioned on Object Size and Category
    Dimou, Dimitrios
    Santos-Victor, Jose
    Moreno, Plinio
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4710 - 4716
  • [30] Hybrid Elmann-BiLSTM Based Brain Tumor Classification on Augmented Data with Combination of Variational Auto-Encoders and Generative Adversarial Network
    Balci, Furkan
    TRAITEMENT DU SIGNAL, 2024, 41 (02) : 753 - 769