Add Support For VibeVoice-Realtime-0.5B?

It appears to load the model and run but the audio output is garbled.

```
[VibeVoice] Model loaded in 50.59 seconds
[VibeVoice] Loading VibeVoice processor...
[VibeVoice] Found Qwen tokenizer in: ...\ComfyUI\models\vibevoice\tokenizer
[VibeVoice] Found complete tokenizer at: ...\ComfyUI\models\vibevoice\tokenizer
[VibeVoice] Standard from_pretrained failed: expected str, bytes or os.PathLike object, not NoneType
[VibeVoice] Trying with allow remote files...
[VibeVoice] Processing text segment 1 (10 words)
[VibeVoice] Starting audio generation with 20 diffusion steps...
[VibeVoice] Generating audio with 20 diffusion steps...
[VibeVoice] Note: Progress bar shows max possible tokens, not actual needed (~30 estimated)
[VibeVoice] The generation will stop automatically when audio is complete
[VibeVoice] Concatenating 1 audio segments (including pauses)...
[VibeVoice] Successfully generated audio with 1 segments
[VibeVoice] Model and processor memory freed successfully
Prompt executed in 69.61 seconds
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Support For VibeVoice-Realtime-0.5B? #192

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Add Support For VibeVoice-Realtime-0.5B? #192

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions