-
Notifications
You must be signed in to change notification settings - Fork 91
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
bugfix: fix the issue of ineffective input embedding transmission.
#490
opened Dec 5, 2025 by
magicheng0816
Loading…
refactor: separate the weight loading in the npu layer class.
#489
opened Dec 5, 2025 by
Clement-Wang26
Loading…
feat: support iluvatar backend qwen3 0.6b run through
ilu
#481
opened Dec 4, 2025 by
laneeeee
Loading…
feat: add wrappers for ATB and ACLNN fused operators.
#474
opened Dec 2, 2025 by
yingxudeng
Loading…
refactor: separate mlu and cuda version Qwen model implementation.
cuda
#468
opened Dec 1, 2025 by
XuZhang99
Loading…
refactor: optimize unique token count preparation of batch input builder.
#449
opened Nov 27, 2025 by
RobbieLeung
Loading…
[WIP] feat: support loading model weights and forward overlap.
#441
opened Nov 26, 2025 by
Clement-Wang26
Loading…
feat: support Qwen2-VL & GME-Qwen2-VL model on npu device.
#399
opened Nov 18, 2025 by
xanecdotex
Loading…
feat: enable torch_npu graph mode for Qwen-3 dense with TP support.
#325
opened Nov 6, 2025 by
yingxudeng
Loading…
ProTip!
Follow long discussions with comments:>50.