Advanced RVC Inference

A modular Retrieval-based Voice Conversion framework with Gradio UI, training capabilities, and audio processing tools

Features

Voice Conversion: High-quality voice conversion with multiple pitch extraction methods
Model Training: Complete training pipeline for creating custom RVC models
Real-time Processing: Low-latency real-time voice conversion support
Web UI: Intuitive Gradio-based web interface
CLI Support: Command-line interface for scripting and automation
API Access: Python API for programmatic access
Audio Separation: Built-in tools for vocal/instrument separation
Text-to-Speech: Integration with edge-tts for TTS-based voice conversion

Installation

pip install git+https://github.com/ArkanDash/Advanced-RVC-Inference.git

With GPU Support

For CUDA-enabled GPUs:

pip install git+https://github.com/ArkanDash/Advanced-RVC-Inference.git#egg=advanced-rvc-inference[gpu]

From Source

git clone https://github.com/ArkanDash/Advanced-RVC-Inference.git
cd Advanced-RVC-Inference
pip install -e .

Development Installation

pip install git+https://github.com/ArkanDash/Advanced-RVC-Inference.git#egg=advanced-rvc-inference[dev]

Quick Start

Web Interface

Launch the Gradio web UI:

rvc-gui
# or
python -m advanced_rvc_inference.gui

The web interface will be available at http://localhost:7860

Command Line Interface

Run voice conversion from the command line:

rvc-cli infer --model path/to/model.pth --input audio.wav --output converted.wav --pitch 0

View help:

rvc-cli --help
rvc-cli infer --help

Python API

from advanced_rvc_inference import RVCInference

# Initialize the inference engine
rvc = RVCInference(device="cuda:0")

# Load a model
rvc.load_model("path/to/model.pth")

# Run inference
audio = rvc.infer("input.wav", pitch_change=0, output_path="output.wav")

# Or use batch processing
audio_files = rvc.infer_batch(
    input_dir="input_folder",
    output_dir="output_folder",
    pitch_change=2,
    format="wav"
)

# Cleanup
rvc.unload_model()

Command Reference

CLI Commands

Command	Description
`rvc-cli infer`	Run voice conversion inference
`rvc-cli train`	Train RVC models (use web UI)
`rvc-cli serve`	Launch the web interface
`rvc-cli version`	Show version information
`rvc-cli info`	Show system information

Inference Options

rvc-cli infer \
    --model MODEL.pth \
    --input input.wav \
    --output output.wav \
    --pitch 0 \
    --format wav \
    --index INDEX.index

Configuration

Environment Variables

Variable	Description	Default
`ARVC_ASSETS_PATH`	Path to asset directory	Package assets folder
`ARVC_CONFIGS_PATH`	Path to configs directory	Package configs folder
`ARVC_WEIGHTS_PATH`	Path to model weights	assets/weights
`ARVC_LOGS_PATH`	Path to logs directory	assets/logs

Configuration File

Configuration is managed through advanced_rvc_inference/configs/config.json:

{
    "device": "cuda:0",
    "fp16": true,
    "app_port": 7860,
    "language": "vi-VN",
    "theme": "NoCrypt/miku"
}

Dependencies

Core Dependencies

Python 3.10+
PyTorch 2.3.1+
torchaudio 2.3.1+
NumPy, SciPy
librosa (audio processing)
Gradio (web UI)

Optional Dependencies

onnxruntime-gpu (GPU inference acceleration)
faiss-gpu (vector similarity search)
tensorboard (training visualization)

See pyproject.toml for the complete dependency list.

Documentation

Troubleshooting

GPU Not Detected

Ensure you have CUDA installed and PyTorch with CUDA support:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

Memory Issues

Reduce batch size or use CPU mode:

rvc = RVCInference(device="cpu")

Contributing

Contributions are welcome! Please read our Contributing Guide for details.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Terms of Use

The use of the converted voice for the following purposes is prohibited:

Criticizing or attacking individuals
Advocating for or opposing specific political positions, religions, or ideologies
Publicly displaying strongly stimulating expressions without proper zoning
Selling of voice models and generated voice clips
Impersonation of the original owner of the voice with malicious intentions
Fraudulent purposes that lead to identity theft or fraudulent phone calls

Credits

Repository	Owner
Vietnamese-RVC	PhamHuynhAnh16
Applio	IAHispano

Support

For issues and feature requests, please use the GitHub Issues page.

Made with by ArkanDash

Name		Name	Last commit message	Last commit date
Latest commit History 330 Commits
advanced_rvc_inference		advanced_rvc_inference
tests		tests
.gitignore		.gitignore
Advanced-RVC-NoUI.ipynb		Advanced-RVC-NoUI.ipynb
Advanced-RVC.ipynb		Advanced-RVC.ipynb
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
installer.bat		installer.bat
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
tmp		tmp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Advanced RVC Inference

Features

Installation

With GPU Support

From Source

Development Installation

Quick Start

Web Interface

Command Line Interface

Python API

Command Reference

CLI Commands

Inference Options

Configuration

Environment Variables

Configuration File

Dependencies

Core Dependencies

Optional Dependencies

Documentation

Troubleshooting

GPU Not Detected

Memory Issues

Contributing

License

Terms of Use

Credits

Support

About

Uh oh!

Releases

Uh oh!

Contributors 7

Uh oh!

Languages

License

ArkanDash/Advanced-RVC-Inference

Folders and files

Latest commit

History

Repository files navigation

Advanced RVC Inference

Features

Installation

With GPU Support

From Source

Development Installation

Quick Start

Web Interface

Command Line Interface

Python API

Command Reference

CLI Commands

Inference Options

Configuration

Environment Variables

Configuration File

Dependencies

Core Dependencies

Optional Dependencies

Documentation

Troubleshooting

GPU Not Detected

Memory Issues

Contributing

License

Terms of Use

Credits

Support

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Contributors 7

Uh oh!

Languages