Skip to content

Commit a4cc292

Browse files
Remove deprecated arg in vllm serve command (#1230)
Signed-off-by: dennis yeh <dennis.yeh@cienet.com>
1 parent 64693b0 commit a4cc292

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/e2e/benchmarking/mm_bench_recipe.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -78,7 +78,7 @@ checkThroughput() {
7878
}
7979

8080
echo "Spinning up the vLLM server..."
81-
(vllm serve "$model_name" --tensor-parallel-size 1 --pipeline-parallel-size 1 --dtype bfloat16 --gpu-memory-utilization 0.98 --max-model-len "$max_model_len" --limit-mm-per-prompt '{"image": 10, "video": 0}' --mm-processor-kwargs '{"max_pixels": 1003520}' --guided-decoding-backend "xgrammar" --disable-chunked-mm-input 2>&1 | tee -a "$LOG_FILE") &
81+
(vllm serve "$model_name" --tensor-parallel-size 1 --pipeline-parallel-size 1 --dtype bfloat16 --gpu-memory-utilization 0.98 --max-model-len "$max_model_len" --limit-mm-per-prompt '{"image": 10, "video": 0}' --mm-processor-kwargs '{"max_pixels": 1003520}' --disable-chunked-mm-input 2>&1 | tee -a "$LOG_FILE") &
8282

8383

8484
# Run a busy loop to block until the server is ready to receive requests

0 commit comments

Comments
 (0)