fix: remove FlashInfer due to version mismatch with vLLM 0.11.0 #246

TimPietruskyRunPod · 2025-12-05T07:38:17Z

FlashInfer was installed for cu121/torch2.3 but vLLM 0.11.0 brings torch 2.8.0, causing binary incompatibility and import errors.

vLLM will automatically use FlashAttention or other backends.

Fixes unhealthy workers caused by FlashInfer import errors.

FlashInfer was installed for cu121/torch2.3 but vLLM 0.11.0 brings torch 2.8.0, causing binary incompatibility and import errors. vLLM will automatically use FlashAttention or other backends. Fixes unhealthy workers caused by FlashInfer import errors.

samuelexferri · 2025-12-05T16:55:34Z

Update also to vLLM 0.12.0

TimPietruskyRunPod added 2 commits December 4, 2025 18:06

chore: trigger rebuild

f3a9806

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: remove FlashInfer due to version mismatch with vLLM 0.11.0 #246

fix: remove FlashInfer due to version mismatch with vLLM 0.11.0 #246

Uh oh!

TimPietruskyRunPod commented Dec 5, 2025

Uh oh!

samuelexferri commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: remove FlashInfer due to version mismatch with vLLM 0.11.0 #246

Are you sure you want to change the base?

fix: remove FlashInfer due to version mismatch with vLLM 0.11.0 #246

Uh oh!

Conversation

TimPietruskyRunPod commented Dec 5, 2025

Uh oh!

samuelexferri commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants