Skip to content

Pull requests: vllm-project/tpu-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add disagg test to v6e-8 queue
#1259 opened Dec 6, 2025 by sixiang-google Loading…
[Kernel][FusedMoE] Fix MoE crash and hang issues ready ONLY add when PR is ready to merge/full CI is needed
#1252 opened Dec 5, 2025 by bythew3i Loading…
docs: update support matrices and improve visuals
#1250 opened Dec 5, 2025 by RobMulla Loading…
Avoid installing CUDA related stuff
#1246 opened Dec 4, 2025 by wdhongtw Loading…
Reduce image size and enhance caching
#1245 opened Dec 4, 2025 by wdhongtw Loading…
[Bug fix] KV cache quantization type casting ready ONLY add when PR is ready to merge/full CI is needed
#1244 opened Dec 4, 2025 by wenxindongwork Loading…
update run_in_docker script for running on local env ready ONLY add when PR is ready to merge/full CI is needed
#1243 opened Dec 4, 2025 by ernie-chang Draft
Verify vllm-tpu python package (draft) ready ONLY add when PR is ready to merge/full CI is needed
#1241 opened Dec 4, 2025 by ylangtsou Draft
Remove a branch with pl.when in fetching bkv ready ONLY add when PR is ready to merge/full CI is needed
#1239 opened Dec 4, 2025 by rupengliu-meta Loading…
[CI] Fix awq dtype ready ONLY add when PR is ready to merge/full CI is needed
#1220 opened Dec 2, 2025 by kyuyeunk Loading…
[Oncall] update the SchedulerConfig interface
#1219 opened Dec 2, 2025 by bzgoogle Loading…
Add a SP e2e test.
#1209 opened Dec 2, 2025 by vanbasten23 Loading…
[RPA] Pipeline flash attention in default kernel ready ONLY add when PR is ready to merge/full CI is needed
#1203 opened Dec 1, 2025 by jrplatin Loading…
Save size in scalar scratch for bo and bq
#1201 opened Dec 1, 2025 by rupengliu-meta Loading…
Update README.md
#1197 opened Nov 27, 2025 by bvrockwell Loading…
[Qwix/Flax] Upgrade to Flax 0.12.0 + Qwix 0.1.4
#1170 opened Nov 25, 2025 by jrplatin Loading…
[do not merge] test status check POC ready ONLY add when PR is ready to merge/full CI is needed
#1168 opened Nov 25, 2025 by khluu Loading…
[Feat][TPU Offload] KV cache offload to local cpu buffer ready ONLY add when PR is ready to merge/full CI is needed
#1163 opened Nov 24, 2025 by juncgu-google Loading…
DP support for GPT OSS
#1096 opened Nov 13, 2025 by wenxindongwork Draft
Enable Pipeline Parallelism on Jax models ready ONLY add when PR is ready to merge/full CI is needed
#1077 opened Nov 12, 2025 by Chenyaaang Loading…
1 of 8 tasks
Exposes graphdef for flax models.
#1059 opened Nov 10, 2025 by wang2yn84 Loading…
ProTip! Adding no:label will show everything without a label.