Only load Verifier model if attachment_mode is 'full' #154

fynnsu · 2025-10-23T21:47:26Z

Currently, if verifier_attachment_mode is "full" or "train_only" we load the full verifier model on init.

However, for the "train_only" case, we want to only load the parts of the model that are needed. The logic to do so, is meant to reside in the subclasses of SpeculatorModel.

This change, makes it so that we don't auto-load the full verifier model if "train_only" is selected.
Additional changes:

SpeculatorModel.attach_verifier() no longer returns the loaded verifier (since it might not exist)
EagleSpeculator.attach_verifier() directly calls resolve_verifier() to load the verifier model and extract its components.
test_speculator_model_attach_verifier_invalid needed to be updated to reset the model between subtests because otherwise the verifier_attachment_mode gets set and kept by the previous try-excepts blocks.

Signed-off-by: Fynn Schmitt-Ulms <[email protected]>

github-actions · 2025-10-23T21:50:03Z

📦 Build Artifacts Available
The build artifacts (`.whl` and `.tar.gz`) have been successfully generated and are available for download: https://github.com/vllm-project/speculators/actions/runs/18918018372/artifacts/4409360027.
They will be retained for up to 30 days.
Commit: 2b17856

shanjiaz

cool

brian-dellabetta

bummer that we have to have all those # type: ignore[assignment,attr-defined] comments

fynnsu · 2025-10-24T19:59:56Z

bummer that we have to have all those # type: ignore[assignment,attr-defined] comments

Yeah I'm honestly not too sure what the deal with those is. We might want to try removing them at some point, but it's also possible this eagle3 file will be removed in favor of the new training one

Only load Verifier model if attachment_mode is 'full'

1adfa90

Signed-off-by: Fynn Schmitt-Ulms <[email protected]>

shanjiaz approved these changes Oct 24, 2025

View reviewed changes

brian-dellabetta approved these changes Oct 24, 2025

View reviewed changes

fynnsu added 2 commits October 24, 2025 16:02

Merge branch 'main' into fynnsu/reduce_verifier_load

74da521

Merge branch 'main' into fynnsu/reduce_verifier_load

2b17856

fynnsu enabled auto-merge (squash) October 29, 2025 18:18

fynnsu merged commit 5bb69ec into main Oct 29, 2025
13 checks passed

fynnsu deleted the fynnsu/reduce_verifier_load branch October 29, 2025 18:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Only load Verifier model if attachment_mode is 'full' #154

Only load Verifier model if attachment_mode is 'full' #154

Uh oh!

fynnsu commented Oct 23, 2025

Uh oh!

github-actions bot commented Oct 23, 2025 •

edited

Loading

Uh oh!

shanjiaz left a comment

Uh oh!

brian-dellabetta left a comment

Uh oh!

fynnsu commented Oct 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Only load Verifier model if attachment_mode is 'full' #154

Only load Verifier model if attachment_mode is 'full' #154

Uh oh!

Conversation

fynnsu commented Oct 23, 2025

Uh oh!

github-actions bot commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shanjiaz left a comment

Choose a reason for hiding this comment

Uh oh!

brian-dellabetta left a comment

Choose a reason for hiding this comment

Uh oh!

fynnsu commented Oct 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Oct 23, 2025 •

edited

Loading