forked from NVIDIA/Megatron-LM
-
Notifications
You must be signed in to change notification settings - Fork 36
Pull requests: ROCm/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enable Qwen3 and Deepseekv3 mxfp8 Training
#134
opened May 25, 2026 by
sudhu2k
Collaborator
Loading…
MXFP8 training fixes for Megatron-FSDP, Torch-FSDP, MoE, FP8 Parameter initialization
#130
opened May 15, 2026 by
sudhu2k
Collaborator
Loading…
ProTip!
What’s not been updated in a month: updated:<2026-05-01.