Skip to content

Conversation

@kabachuha
Copy link

This pull request adds "manual" first-last frame support for Hunyuan1.5 video generation via latent concatenation.

The code is very simple and based on the first-last-frame implementation for Wan.

Because Hunyuan uses CLIP vision embeddings as input, without the dedicated model only one of them is used as provided.

One thing I would like to have help with is the last frame often "flickering" without direct soft transition. I also observed this problem for Wan2.2/VACE, but there it was less noticeable.

Otherwise, it looks good and it indeed takes the last frame's subject in consideration.

hunyuan_video_1.5_00010.mp4
ComfyUI_temp_rqlky_00011_ ComfyUI_temp_rqlky_00002_

Closes #11020.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Last frame input for Hunyuan1.5 Image2Video

1 participant