We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 62c4476 commit 8d3975fCopy full SHA for 8d3975f
vllm/v1/cudagraph_dispatcher.py
@@ -161,9 +161,9 @@ def dispatch(
161
A new batch descriptor is returned as we might dispatch a uniform batch
162
to a graph that supports a more general batch (uniform to non-uniform).
163
"""
164
- # if not initialized, just skip dispatching.
165
if (
166
not self.keys_initialized
+ or self.cudagraph_mode == CUDAGraphMode.NONE
167
or num_tokens > self.compilation_config.max_cudagraph_capture_size
168
):
169
return CUDAGraphMode.NONE, BatchDescriptor(num_tokens)
0 commit comments