-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
docs: improve documentation organization and add additional guides
docs-only
documentation only (docs or docstrings)
Expert Review
Apply this label to indicate that your PR is ready for expert review.
#2671
opened Dec 16, 2025 by
sbhavani
Loading…
1 of 6 tasks
[dev] feat(moe): Support MLA CP exchanging latent KV
#2670
opened Dec 16, 2025 by
yuzhongw-nvidia
•
Draft
6 tasks
[docs] Add ability to disable autodoc2 for local builds
docs-only
documentation only (docs or docstrings)
#2669
opened Dec 15, 2025 by
Phlip79
Loading…
6 tasks
Prep work for migrating to types from ModuleSpec
community-request
#2668
opened Dec 15, 2025 by
nschank
Loading…
2 of 6 tasks
fix hang issue for pp>1 vpp>1 and variable_seq_lengths==True
Expert Review
Apply this label to indicate that your PR is ready for expert review.
#2664
opened Dec 15, 2025 by
shifangx
Loading…
6 tasks
add all_gather process-group for overlapping in fsdp disributed training
#2663
opened Dec 15, 2025 by
jeffnvidia
•
Draft
6 tasks
feat: Add VPP training simulation framework for performance profiling
community-request
#2659
opened Dec 14, 2025 by
lsy643
Loading…
6 tasks
[MAIN][NVFP4][MOE] 128 Zero Padding for Grouped Quantization kernels and Cuda Graph Support
#2655
opened Dec 13, 2025 by
zhongbozhu
Loading…
6 tasks
Move full model init to cuda stream to avoid race condition leading to empty parameters in DDP
#2652
opened Dec 12, 2025 by
jstjohn
Loading…
6 tasks
[main] feat(moe): Support packed sequence for gated delta net (GDN)
#2645
opened Dec 12, 2025 by
yuzhongw-nvidia
•
Draft
6 tasks
[dev] feat(moe): Support packed sequence for gated delta net (GDN)
#2644
opened Dec 12, 2025 by
yuzhongw-nvidia
Loading…
6 tasks
[Main] Feat(moe): Gated delta net context parallel (CP)
#2642
opened Dec 12, 2025 by
yuzhongw-nvidia
•
Draft
6 tasks
[Dev] Fix CUDA RNG Tracker
core_dev_r0.15.0
dev branch
Dev branch related issues and development
Expert Review
Apply this label to indicate that your PR is ready for expert review.
Final Review
Apply this label to indicate that your PR is ready for final review.
Print more verbose error message about incorrect
model_parallel_size.
community-request
#2639
opened Dec 12, 2025 by
rj42
Loading…
6 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.