Skip to content

Autolume is a no-coding generative AI system allowing artists to train, craft, and explore their own models.

Notifications You must be signed in to change notification settings

Metacreation-Lab/autolume

Repository files navigation

Autolume

Autolume-Live is a tool for real-time visual performances using StyleGAN.

For detailed documentation and demos look at the Metacreation Lab Website.

Autolume-live Screen

Downloads

Latest release:

Examples

For examples of artworks created with Autolume see: https://www.metacreation.net/artworks

Development instructions

Dependencies

Windows only

  • Microsoft C++ Build Tools (download link)
    • Minimum components: Desktop development with C++

Linux only (Ubuntu 24.04)

sudo apt install portaudio19-dev ffmpeg avahi-daemon
sudo systemctl enable --now avahi-daemon

Install the NDI SDK for Linux:

curl -L https://downloads.ndi.tv/SDK/NDI_SDK_Linux/Install_NDI_SDK_v6_Linux.tar.gz -o /tmp/ndi_sdk.tar.gz
tar -xzf /tmp/ndi_sdk.tar.gz -C /tmp
cd /tmp && sh Install_NDI_SDK_v6_Linux.sh

The installer creates a directory with spaces in its name. Create a symlink for easier use:

ln -s "/tmp/NDI SDK for Linux" /tmp/ndi-sdk

Initial setup

Create the Python environment:

conda create -n autolume python=3.10
conda activate autolume
pip install -r requirements.txt

Linux: install ndi-python from source

The ndi-python pip package does not ship pre-built Linux wheels. Build it from the git repo using the NDI SDK symlink:

git clone --recursive https://github.com/buresu/ndi-python.git /tmp/ndi-python
export CMAKE_ARGS="-DNDI_SDK_DIR=/tmp/ndi-sdk"
pip install /tmp/ndi-python -v

Download pre-trained models

Download pre-trained models from Real-ESRGAN and face-parsing.PyTorch:

curl -L 'https://github.com/xinntao/Real-ESRGAN/releases/download/v0.1.0/RealESRGAN_x4plus.pth' -o 'sr_models/Quality.pth'
curl -L 'https://github.com/xinntao/Real-ESRGAN/releases/download/v0.2.5.0/realesr-general-x4v3.pth' -o 'sr_models/Balance.pth'
mkdir -p training/distillation/Util/face_parsing/pretrained_model
curl -L 'https://drive.google.com/uc?export=download&id=154JgKpzCPW82qINcVieuPH3fZ2e0P812' -o 'training/distillation/Util/face_parsing/pretrained_model/79999_iter.pth'

Download FFmpeg binaries (Windows only)

mkdir bin
ffdl download 7.1.1@full -d bin -y
tar -xf bin\ffmpeg-7.1.1-full_build.zip -C bin

On Linux, FFmpeg is installed via apt (see Linux dependencies above).

Download default FFHQ model

mkdir -p models
curl -L 'https://api.ngc.nvidia.com/v2/models/org/nvidia/team/research/stylegan2/1/files?redirect=true&path=stylegan2-ffhq-512x512.pkl' -o 'models/stylegan2-ffhq-512x512.pkl'

Running the program

Activate the environment (if not already activated)

conda activate autolume

Start the program

python main.py

Create release for Windows

Run the release script

release.bat

The release will be output to dist/Autolume

Building documentation

mkdocs build

The documentation will be output to site.

To serve the documentation locally:

mkdocs serve

The documentation will be served at http://127.0.0.1:8000/.

Credits

  • Philippe Pasquier: Director of Metacreation Lab for Creative AI, Principal Investigator
  • Arshia Sobhan: Project management
  • Lionel Ringenbach: Software development
  • Michael Tjokrowardojo: Software development
  • Jonas Kraasch: Project first-phase development
  • Arthur Deleu: Project development
  • Mahshid Jabari: DevOps
  • Peter Chen: Contributions to the super-resolution module
  • Manuel Bolduc: User testing
  • Ahmed Abuzuraiq: Research associate

Acknowledgment

StyleGAN3: https://github.com/NVlabs/stylegan3

GAN compression: https://github.com/lychenyoko/content-aware-gan-compression

GANSpace: https://github.com/harskish/ganspace

License

Some of the dependencies fall under the Nvidia Source Code License.

Project status

This code base is still open to changes and bugs might still appear. Please create issues and let us know so we can polish it for the final release!

As a dedicated research team behind Autolume, we emphasize that our system is a tool for exploration and innovation without direct support.

About

Autolume is a no-coding generative AI system allowing artists to train, craft, and explore their own models.

Topics

Resources

Stars

Watchers

Forks

Contributors 10