jd-opensource / xllm Public

Notifications You must be signed in to change notification settings
Fork 91
Star 784

Code
Issues 39
Pull requests 18
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: jd-opensource/xllm

Labels 14 Milestones 0

New pull request New

18 Open 387 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

bugfix: fix the issue of ineffective input embedding transmission.

#490 opened Dec 5, 2025 by magicheng0816

Loading…

refactor: separate the weight loading in the npu layer class.

#489 opened Dec 5, 2025 by Clement-Wang26

Loading…

bugfix: fix core dump of large beam width.

#488 opened Dec 5, 2025 by RobbieLeung

Loading…

feat: optimize prefetch from kv cache store.

#486 opened Dec 4, 2025 by Kang-Meng

Loading…

feat: set same random seed for all worker.

#483 opened Dec 4, 2025 by RobbieLeung

Loading…

feat: support iluvatar backend qwen3 0.6b run through ilu

#481 opened Dec 4, 2025 by laneeeee

Loading…

Add constrained decoding

#480 opened Dec 3, 2025 by magicheng0816

Loading…

feat: support new model glm4.

#477 opened Dec 3, 2025 by DongheJin

Loading…

feat: add wrappers for ATB and ACLNN fused operators.

#474 opened Dec 2, 2025 by yingxudeng

Loading…

feat: add mm embedding model and its factory.

#471 opened Dec 2, 2025 by dongxianzhe

Loading…

feat: support prefix cache for deepseek-v3/r1 models.

#470 opened Dec 1, 2025 by DongheJin

Loading…

refactor: separate mlu and cuda version Qwen model implementation. cuda

#468 opened Dec 1, 2025 by XuZhang99

Loading…

feat: support deepseek mtp on mlu. mlu

#454 opened Nov 28, 2025 by a120092009

Loading…

refactor: optimize unique token count preparation of batch input builder.

#449 opened Nov 27, 2025 by RobbieLeung

Loading…

[WIP] feat: support loading model weights and forward overlap.

#441 opened Nov 26, 2025 by Clement-Wang26

Loading…

feat: support Qwen2-VL & GME-Qwen2-VL model on npu device.

#399 opened Nov 18, 2025 by xanecdotex

Loading…

feat: enable torch_npu graph mode for Qwen-3 dense with TP support.

#325 opened Nov 6, 2025 by yingxudeng

Loading…

【WIP】feat: add rec framwork.

#305 opened Oct 31, 2025 by DragonFive

Loading…

3 of 5 tasks

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!