Skip to content

Commit 12c4885

Browse files
committed
update ci tests
Signed-off-by: Juncheng Gu <jcgu@google.com>
1 parent 560caf8 commit 12c4885

File tree

5 files changed

+5
-49
lines changed

5 files changed

+5
-49
lines changed

.buildkite/features/KV_Cache_Host_Offloading.yml

Lines changed: 0 additions & 45 deletions
This file was deleted.

.buildkite/features/KV_Cache_Offload.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ steps:
99
commands:
1010
- |
1111
.buildkite/scripts/run_in_docker.sh \
12-
python3 -m pytest -s -v /workspace/tpu_inference/tests/distributed/offload/
12+
python3 -m pytest -s -v /workspace/tpu_inference/tests/distributed/offload/tpu_offload_accuracy_test.py
1313
- label: "Record correctness test result for KV Cache Offload"
1414
key: "record_KV_Cache_Offload_CorrectnessTest"
1515
depends_on: "KV_Cache_Offload_CorrectnessTest"

.buildkite/pipeline_jax.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -122,6 +122,7 @@ steps:
122122
--ignore=/workspace/tpu_inference/tests/e2e \
123123
--ignore=/workspace/tpu_inference/tpu_inference/mock \
124124
--ignore=/workspace/tpu_inference/tests/layers/vllm/test_compressed_tensors_moe.py \
125+
--ignore=/workspace/tpu_inference/tests/distributed/offload/test_offload_accuracy_test.py \
125126
--cov-config=/workspace/tpu_inference/.coveragerc --cov tpu_inference --cov-report term-missing --cov-fail-under=69
126127
127128
- label: "JAX unit tests - kernels"

tests/distributed/offload/tpu_offload_accuracy_test.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -60,7 +60,6 @@ def _test_kv_cache_cpu_offloading_accuracy(
6060
os.environ['TPU_OFFLOAD_DECODE_SAVE'] = decode_save
6161
llm = LLM(model="meta-llama/Llama-3.2-3B",
6262
max_model_len=1024,
63-
tensor_parallel_size=8,
6463
task="generate",
6564
kv_transfer_config=kv_transfer_config)
6665

tests/distributed/offload/tpu_offload_connector_worker_test.py

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -72,9 +72,10 @@ def setUp(self):
7272
self.num_blocks = 128
7373
self.num_cpu_chunks = 128
7474
self.block_size = self.vllm_config.cache_config.block_size
75-
self.num_heads = 8
75+
num_devices = len(list(jax.devices()))
76+
self.num_heads = num_devices
7677
self.head_size = 128
77-
self.mesh = self.create_mesh((1, 8), ("data", "model"))
78+
self.mesh = self.create_mesh((1, num_devices), ("data", "model"))
7879
if self.mesh is None:
7980
self.skipTest("Cannot create mesh. Must be run on a TPU node.")
8081
return

0 commit comments

Comments
 (0)