-
Notifications
You must be signed in to change notification settings - Fork 635
Pull requests: pytorch/torchtune
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Gemma2] Use nn.SDPA via MultiHeadAttention
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Applies self.act_fn (vs. hardcoded F.silu) in GroupedExperts.forward
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2838
opened Jun 20, 2025 by
athms
Loading…
2 of 13 tasks
Add DCP async checkpointing info to docs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2837
opened Jun 18, 2025 by
ankitageorge
Loading…
11 tasks
Add DCP async checkpointing for lora dpo recipe
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2835
opened Jun 18, 2025 by
ankitageorge
Loading…
2 of 10 tasks
[DO NOT MERGE] fix main
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2832
opened Jun 16, 2025 by
joecummings
Loading…
[DONT MERGE] Debug branch for the Qwen3 + full model compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2831
opened Jun 16, 2025 by
anijain2305
Loading…
skip compiling opt step instead of erroring if opt_in_bwd=True
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2827
opened Jun 13, 2025 by
felipemello1
Loading…
1 of 4 tasks
raise error if fsdp_cpu_offload + opt_in_bwd
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2826
opened Jun 13, 2025 by
felipemello1
Loading…
1 of 4 tasks
Fix #2809, modify attention
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2822
opened Jun 12, 2025 by
krammnic
Loading…
2 of 13 tasks
[WIP] Qwen3 MoE support
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2820
opened Jun 12, 2025 by
intervitens
•
Draft
6 tasks
[RFC] on-the-fly packing
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2819
opened Jun 12, 2025 by
felipemello1
Loading…
[WIP] Integrate OptimizerInBackward into SFT distributed recipe
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2818
opened Jun 11, 2025 by
joecummings
•
Draft
Test alignment of shared methods in recipes
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2807
opened Jun 9, 2025 by
Andrei-Aksionov
•
Draft
Integrate Muon optimizer (2725)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2803
opened Jun 8, 2025 by
Saurabh750
Loading…
1 of 13 tasks
Fix command in config
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2796
opened Jun 6, 2025 by
krammnic
Loading…
1 of 13 tasks
[WIP] Proper tool calling support in the torchtune
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2794
opened Jun 6, 2025 by
krammnic
Loading…
2 of 13 tasks
[RFC] Reward modeling
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2788
opened Jun 5, 2025 by
krammnic
Loading…
[RFC] Iterable Dataset
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2785
opened Jun 4, 2025 by
felipemello1
Loading…
Enable loss parallel, Ungate FP8
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2782
opened Jun 3, 2025 by
nathan-az
Loading…
1 of 13 tasks
Ungate FP8 + TP
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
[NOT FOR REVIEW] Full knowledge distillation recipe TP + FP8
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
[WIP] DSV3
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2764
opened May 27, 2025 by
SalmanMohammadi
•
Draft
2 of 7 tasks
Add This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
LRScheduler.state_dict()
to checkpoints
CLA Signed
#2762
opened May 23, 2025 by
omkar-334
Loading…
2 of 13 tasks
[WIP][DEBUG] llama4 debugging
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2756
opened May 21, 2025 by
IvanKobzarev
Loading…
Fixing counting number of batches for accumulation through epoch
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2745
opened May 17, 2025 by
wesbz
Loading…
7 of 13 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.