|
1 | 1 | Feature,CorrectnessTest,PerformanceTest |
2 | | -"Collective Communication Matmul",✅,N/A |
3 | | -"Prefix Caching",✅,✅ |
4 | | -"Multimodal Inputs",✅,✅ |
5 | | -"Quantized Matmul Attention and KV Cache",✅,✅ |
6 | 2 | "Chunked Prefill",✅,✅ |
7 | | -"JAX-Path Qxix Quantization",✅,✅ |
| 3 | +"DCN-based P/D disaggregation",to be added,to be added |
| 4 | +"KV cache host offloading",to be added,to be added |
| 5 | +"Llama 4 Maverick",to be added,to be added |
| 6 | +"LoRA_Torch",✅,to be added |
| 7 | +"Multimodal Inputs",✅,✅ |
| 8 | +"Out-of-tree model support",✅,✅ |
| 9 | +"Prefix Caching",✅,✅ |
8 | 10 | "Single Program Multi Data",✅,✅ |
| 11 | +"Speculative Decoding: Eagle3",✅,✅ |
9 | 12 | "Speculative Decoding: Ngram",✅,✅ |
10 | | -"Structured Decoding",✅,N/A |
11 | | -"Ragged Paged Attention V3",✅,✅ |
| 13 | +"async scheduler",✅,✅ |
| 14 | +"runai_model_streamer_loader",✅,N/A |
| 15 | +"sampling_params",✅,N/A |
| 16 | +"structured_decoding",✅,N/A |
0 commit comments