Skip to content

Pull requests: deepspeedai/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Reduce blockDim in fake_quantize_kernel for improved SM occupancy
#8115 opened Jul 2, 2026 by flutist Contributor Loading…
Add Ulysses SP support for FLA gated-delta context parallelism
#8114 opened Jul 2, 2026 by xylian86 Collaborator Loading…
Fix ZeRO-3 autocast gather with mixed parameter dtypes
#8113 opened Jul 2, 2026 by tohtana Collaborator Loading…
Make DCO workflow Probot compatible
#8110 opened Jul 1, 2026 by tohtana Collaborator Loading…
fix: use local ev_values and wrap dict.values() in list()
#8087 opened Jun 23, 2026 by hashwnath Loading…
3 tasks done
Add AutoEP + AutoTP parallel folding
#8064 opened Jun 13, 2026 by tohtana Collaborator Loading…
feat(zenflow): run the overlapped CPU optimizer in a native process
#8058 opened Jun 10, 2026 by Antlera Collaborator Loading…
Add On-Policy Distillation (OPSD) Trainer backend in DeepSpeed
#8027 opened May 26, 2026 by PKUWZP Collaborator Loading…
4 of 5 tasks
Add Qwen 3.5 preset to AutoTP
#7978 opened Apr 16, 2026 by tohtana Collaborator Draft
Refactor/torch autocast encapsulate global state
#7946 opened Apr 2, 2026 by nathon-lee Contributor Loading…
Add torch_xla TPU support for ZeRO-1/2
#7917 opened Mar 21, 2026 by PKUWZP Collaborator Loading…
doc: Remove suggestion to build extensions in parallel
#7899 opened Mar 12, 2026 by Flamefire Contributor Loading…
ProTip! Add no:assignee to see everything that’s not assigned.