-
Notifications
You must be signed in to change notification settings - Fork 202
Open
Description
It appears to load the model and run but the audio output is garbled.
[VibeVoice] Model loaded in 50.59 seconds
[VibeVoice] Loading VibeVoice processor...
[VibeVoice] Found Qwen tokenizer in: ...\ComfyUI\models\vibevoice\tokenizer
[VibeVoice] Found complete tokenizer at: ...\ComfyUI\models\vibevoice\tokenizer
[VibeVoice] Standard from_pretrained failed: expected str, bytes or os.PathLike object, not NoneType
[VibeVoice] Trying with allow remote files...
[VibeVoice] Processing text segment 1 (10 words)
[VibeVoice] Starting audio generation with 20 diffusion steps...
[VibeVoice] Generating audio with 20 diffusion steps...
[VibeVoice] Note: Progress bar shows max possible tokens, not actual needed (~30 estimated)
[VibeVoice] The generation will stop automatically when audio is complete
[VibeVoice] Concatenating 1 audio segments (including pauses)...
[VibeVoice] Successfully generated audio with 1 segments
[VibeVoice] Model and processor memory freed successfully
Prompt executed in 69.61 seconds
Jasonzzt, povgeek, Winnougan and FR-Mister-T
Metadata
Metadata
Assignees
Labels
No labels