Skip to content

[CUDA] Fuse MoE router bias into MatMulNBits GEMV#29170

Merged
kunal-vaishnavi merged 12 commits into
mainfrom
tlwu/20260619/qmoe_router_bias_gemv
Jun 24, 2026
Merged

[CUDA] Fuse MoE router bias into MatMulNBits GEMV#29170
kunal-vaishnavi merged 12 commits into
mainfrom
tlwu/20260619/qmoe_router_bias_gemv

docs+test: align router GEMV scope with code; cover block_size 64

9b23007
Select commit
Loading
Failed to load commit list.
Azure Pipelines / Linux Android Emulator QNN CI Pipeline succeeded Jun 23, 2026 in 13m 51s

Build #20260623.1 succeeded