Hi, If my model is multimodal and the geneates actually defines different like this:
generation_output = model_tp.generate(
pixel_values=pixel_values,
input_ids=input_ids,
attention_mask=attention_mask,
**generation_config,
)
it is don't work.
How to make it work anyway