Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[P0] Enabling FSDP with pyvene backbone change #152

Merged
merged 2 commits into from
Feb 3, 2025
Merged

[P0] Enabling FSDP with pyvene backbone change #152

merged 2 commits into from
Feb 3, 2025

Conversation

frankaging
Copy link
Collaborator

Descriptions:

Ref: stanfordnlp/pyvene#204

The base intervenable class does not support DPP or FSDP. One reason is that the interventions are saved in a python-native dict, rather than a torch-native dict which makes DP wrappers unaware of params of interventions.

This is the first change to implement and validate pyvene backbone is compatible with DPP and FSDP. This change is crucial for the pyreft library to work with larger models, especially when inference-time # token is being scaled.

@frankaging frankaging merged commit da4570e into main Feb 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant