Skip to content

david-hogberg/latent-vector-synthesis

Repository files navigation

Latent Vector Synthesis

Latent Vector Synthesis is a sound synthesis framework combining latent audio spaces and vector synthesis techniques.

This prototype of a Latent Vector Synthesizer incorporates a Variational Autoencoder (VAE) model trained on short single cycle waveforms that enables interpolations and explorations of sonic textures. The generated waveforms are used as part of a vector- and wavetable synthesis engine developed in Pure Data.

The project builds on the work of Tatar et al. [1, 2].

Installation

Python

1 - Download and install Anaconda for your operating system: https://docs.anaconda.com/free/anaconda/install/index.html

2 - Open a terminal and create a new Python environment (here named 'lvs'):

conda create --name lvs python=3.10

3 - Activate your environment:

conda activate lvs

4 - Install PyTorch using conda for your operating system: https://pytorch.org

5 - Install the following Python libraries:

  • Librosa
pip install librosa
  • Python-osc
pip install python-osc

Pure Data

Install Pure Data (Pd-vanilla): https://puredata.info

Run

1 - Open a terminal and navigate to the cloned repository.

2 - Run the Python script:

python main.py

Wait until the osc-infoformation appears (for sending/receiving osc-messages).

3 - Run latent-vector-synth.pd in Pure Data.

  • Make sure the right audio output device is selected (Go to Media —> Audio Settings…)
  • Press RESET and then RANDOMIZE ALL. Make sure the DSP toggle is on.
  • Make sure to toggle AUDIO OUT and raise the gain.

4 - Happy droning!

References

[1] Kıvanç Tatar, Daniel Bisig, and Philippe Pasquier. Latent Timbre Synthesis: Audio-based variational auto-encoders for music composition and sound design applications. Neural Computing and Applications, 33(1):67–84, 2021. URL: https://link.springer.com/10.1007/s00521-020-05424-2, doi:10.1007/s00521-020-05424-2

[2] Kıvanç Tatar, Kelsey Cotton, and Daniel Bisig. Sound design strategies for latent audio space explorations using deep learning architectures. In Proceedings of Sound and Music Computing 2023, 2023.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages