Latent Timbre SynthesisAudio-based variational auto-encoders for music composition and sound design applications

被引：0

作者：

Kıvanç Tatar

Daniel Bisig

Philippe Pasquier

机构：

[1] Simon Fraser University,

[2] Zurich University of the Arts,undefined

来源：

Neural Computing and Applications | 2021年 / 33卷

关键词：

Audio synthesis; Neural networks; Signal processing; Computer assisted music composition;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We present the Latent Timbre Synthesis, a new audio synthesis method using deep learning. The synthesis method allows composers and sound designers to interpolate and extrapolate between the timbre of multiple sounds using the latent space of audio frames. We provide the details of two Variational Autoencoder architectures for the Latent Timbre Synthesis and compare their advantages and drawbacks. The implementation includes a fully working application with a graphical user interface, called interpolate_two, which enables practitioners to generate timbres between two audio excerpts of their selection using interpolation and extrapolation in the latent space of audio frames. Our implementation is open source, and we aim to improve the accessibility of this technology by providing a guide for users with any technical background. Our study includes a qualitative analysis where nine composers evaluated the Latent Timbre Synthesis and the interpolate_two application within their practices.

引用

页码：67 / 84

页数：17

共 33 条

[1] Latent Timbre Synthesis Audio-based variational auto-encoders for music composition and sound design applications
Tatar, Kivanc
Bisig, Daniel
Pasquier, Philippe
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (01): : 67 - 84
[2] Automatic selection of latent variables in variational auto-encoders
Jouffroy, Emma
Giremus, Audrey
Berthoumieu, Yannick
Bach, Olivier
Hugget, Alain
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1407 - 1411
[3] Discriminative regularization of the latent manifold of variational auto-encoders
Kossyk, Ingo
Marton, Zoltan-Csaba
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 61 : 121 - 129
[4] Attribute-based regularization of latent spaces for variational auto-encoders
Pati, Ashis
Lerch, Alexander
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (09): : 4429 - 4444
[5] Attribute-based regularization of latent spaces for variational auto-encoders
Pati, Ashis
Lerch, Alexander
Neural Computing and Applications, 2021, 33 (09) : 4429 - 4444
[6] Attribute-based regularization of latent spaces for variational auto-encoders
Ashis Pati
Alexander Lerch
Neural Computing and Applications, 2021, 33 : 4429 - 4444
[7] Latent Space Interpolation of Synthesizer Parameters Using Timbre-Regularized Auto-Encoders
Le Vaillant, Gwendal
Dutoit, Thierry
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3379 - 3392
[8] Understanding Instance-based Interpretability of Variational Auto-Encoders
Kong, Zhifeng
Chaudhuri, Kamalika
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
[9] Unsupervised Phonocardiogram Analysis With Distribution Density Based Variational Auto-Encoders
Li, Shengchen
Tian, Ke
FRONTIERS IN MEDICINE, 2021, 8
[10] Stacked auto-encoders based visual features for speech/music classification
Kumar, Arvind
Solanki, Sandeep Singh
Chandra, Mahesh
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208

← 1 2 3 4 →