Skip to content

Commit 894c747

Browse files
committed
fix imports in kv_cache tests
Signed-off-by: Juncheng Gu <jcgu@google.com>
1 parent ace918a commit 894c747

11 files changed

+6
-4
lines changed
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.

examples/gke/pod_tpu_commons_cpu_offload.yaml renamed to examples/offload/gke/pod_tpu_commons_cpu_offload.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ spec:
1313
imagePullPolicy: Always # Uncomment to always pull the latest image for any dev work
1414
command:
1515
- python
16-
- /workspace/tpu_inference/examples/offline_inference_kv_cache.py
16+
- /workspace/tpu_inference/examples/offload/offline_inference_kv_cache.py
1717
- --model=meta-llama/Llama-3.1-8B
1818
- --tensor_parallel_size=8
1919
- --max_model_len=1024

examples/gke/pod_tpu_commons_cpu_offload_verification.yaml renamed to examples/offload/gke/pod_tpu_commons_cpu_offload_verification.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ spec:
1919
imagePullPolicy: Always
2020
command:
2121
- python
22-
- /workspace/tpu_inference/examples/offline_inference_kv_cache_verification.py
22+
- /workspace/tpu_inference/examples/offload/offline_inference_kv_cache_verification.py
2323
- --model=meta-llama/Llama-3.1-8B
2424
- --tensor_parallel_size=8
2525
- --max_model_len=1024
File renamed without changes.

examples/offline_inference_kv_cache.py renamed to examples/offload/offline_inference_kv_cache.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@
55

66
import vllm.envs as envs
77
from vllm import LLM, EngineArgs
8-
from vllm.utils import FlexibleArgumentParser
8+
from vllm.utils.argparse_utils import FlexibleArgumentParser
99

1010

1111
def create_parser():

0 commit comments

Comments
 (0)