Commit 16722b9
committed
remove files
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
cleanup
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
cleanup
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
review comment
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
remove dead code
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
cleanup
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
fix doc error
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
cleanup
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
wip
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
clean-up
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
cleanup
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
cleanup
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
wip
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
pad ubatches
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
test fixes
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
fix CPU backend
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
fix typo
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Update vllm/v1/worker/gpu_model_runner.py
Co-authored-by: Luka Govedič <ProExpertProg@users.noreply.github.com>
Signed-off-by: Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
format
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>1 parent 02ccc8d commit 16722b9
File tree
12 files changed
+257
-5792
lines changed- docs/design
- gsm8k-results-pr
- llama3-8b-pad-before-metadata-flashinfer/meta-llama__Meta-Llama-3-8B-Instruct
- llama3-8b-pad-before-metadata/meta-llama__Meta-Llama-3-8B-Instruct
- tests/v1/cudagraph
- vllm
- v1
- attention/backends
- worker
12 files changed
+257
-5792
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
89 | 89 | | |
90 | 90 | | |
91 | 91 | | |
92 | | - | |
| 92 | + | |
93 | 93 | | |
94 | 94 | | |
95 | 95 | | |
| |||
Lines changed: 0 additions & 160 deletions
This file was deleted.
0 commit comments