Skip to content

Pull requests: deepspeedai/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Normalize ZeRO-3 DeepCompile grad dtype before reduction
#8038 opened May 30, 2026 by tohtana Collaborator Loading…
Fix DeepCompile ZeRO-1 grad target lifetime
#8036 opened May 29, 2026 by tohtana Collaborator Draft
Enable bf16 check_grad_overflow by default (matching fp16)
#8035 opened May 29, 2026 by yongzhe-wang Loading…
2 tasks done
Stop obsolete CI jobs on workflow cancellation
#8034 opened May 28, 2026 by tohtana Collaborator Loading…
Fix DeepCompile ZeRO-3 release parameter lifetime
#8032 opened May 28, 2026 by tohtana Collaborator Loading…
[Draft] Add On-Policy Distillation (OPSD) Trainer in DeepSpeed
#8027 opened May 26, 2026 by PKUWZP Collaborator Loading…
3 of 5 tasks
Add Qwen 3.5 preset to AutoTP
#7978 opened Apr 16, 2026 by tohtana Collaborator Draft
Fix/warnings stacklevel mvapich runner
#7949 opened Apr 2, 2026 by nathon-lee Contributor Draft
Refactor/torch autocast encapsulate global state
#7946 opened Apr 2, 2026 by nathon-lee Contributor Loading…
Add AutoEP
#7938 opened Mar 31, 2026 by tohtana Collaborator Loading…
Add torch_xla TPU support for ZeRO-1/2
#7917 opened Mar 21, 2026 by PKUWZP Collaborator Loading…
doc: Remove suggestion to build extensions in parallel
#7899 opened Mar 12, 2026 by Flamefire Contributor Loading…
ProTip! Updated in the last three days: updated:>2026-05-27.