[tx] Make sharding explicit in LoRA constructors by pcmoritz · Pull Request #997 · NovaSky-AI/SkyRL

pcmoritz · 2026-01-31T00:12:41Z

This is in preparation for merging #996, so we don't need to depend on the jax tracer. It is also slightly cleaner this way and the assert is not needed any more, since the error is "defined away".

It also adds the FSDP sharding for llama3.

gemini-code-assist

Code Review

This pull request refactors the LoRA layer constructors (LoRAEmbed, LoRALinear, LoRAExpert) to explicitly accept a sharding parameter. This change streamlines the sharding configuration by removing the need for nnx.with_partitioning calls at the instantiation sites within the model definitions (deepseekv3.py, llama3.py, qwen3.py) and eliminates the associated runtime assertions. The refactoring consistently applies across all affected files, improving clarity and control over sharding for LoRA layers. All changes are well-implemented and align with the stated objective of making sharding explicit.

raulchen

we can define constants for the shardings, as they are repeated.

pcmoritz · 2026-01-31T00:58:57Z

Do you mean across models? Within models there is generally not a lot of repetition, only a little bit, but then it is also nice that each tensor has its sharding explicitly directly next to it. I'm going to merge this for now so we can unblock the other PR, but we can think about whether there is a good way to structure this in a better way to get rid of repetition (and forgetting certain sharding) across models. Let me know if you have some ideas :)

[tx] Make sharding explicit in LoRA constructors

4a86730

pcmoritz added the tx label Jan 31, 2026

gemini-code-assist bot reviewed Jan 31, 2026

View reviewed changes

black

1ec793e

raulchen approved these changes Jan 31, 2026

View reviewed changes

pcmoritz added 2 commits January 30, 2026 17:26

fix llama3 sharding

915571c

fix test

2ebe831

pcmoritz merged commit 7103a2f into NovaSky-AI:main Jan 31, 2026
4 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tx] Make sharding explicit in LoRA constructors#997

[tx] Make sharding explicit in LoRA constructors#997
pcmoritz merged 4 commits intoNovaSky-AI:mainfrom
pcmoritz:tx-update-sharding

pcmoritz commented Jan 31, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

raulchen left a comment

Uh oh!

pcmoritz commented Jan 31, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pcmoritz commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

raulchen left a comment

Choose a reason for hiding this comment

Uh oh!

pcmoritz commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pcmoritz commented Jan 31, 2026 •

edited

Loading

pcmoritz commented Jan 31, 2026 •

edited

Loading