Skip to content

Commit a3ff52b

Browse files
committed
config pre-mapped buffer of tpu
Signed-off-by: Juncheng Gu <jcgu@google.com>
1 parent 8b79f68 commit a3ff52b

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

examples/offload/gke/benchmarks/deploy-cpu-offload.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,12 @@ spec:
3434
value: "4096"
3535
- name: TPU_OFFLOAD_NUM_STAGING_BLOCKS
3636
value: "256"
37+
# config the pre-mapped CPU buffer for TPUs
38+
# https://docs.cloud.google.com/tpu/docs/performance-guide#tpu_model_performance
39+
- name: TPU_PREMAPPED_BUFFER_SIZE
40+
value: "68719476736" # 64 GB
41+
- name: TPU_PREMAPPED_BUFFER_TRANSFER_THRESHOLD_BYTES
42+
value: "68719476736" # 64 GB
3743
ports:
3844
- containerPort: 8000
3945
resources:

0 commit comments

Comments
 (0)