Skip to content

Commit 8d3975f

Browse files
fix CPU backend
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
1 parent 62c4476 commit 8d3975f

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/v1/cudagraph_dispatcher.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -161,9 +161,9 @@ def dispatch(
161161
A new batch descriptor is returned as we might dispatch a uniform batch
162162
to a graph that supports a more general batch (uniform to non-uniform).
163163
"""
164-
# if not initialized, just skip dispatching.
165164
if (
166165
not self.keys_initialized
166+
or self.cudagraph_mode == CUDAGraphMode.NONE
167167
or num_tokens > self.compilation_config.max_cudagraph_capture_size
168168
):
169169
return CUDAGraphMode.NONE, BatchDescriptor(num_tokens)

0 commit comments

Comments
 (0)