[WIP] [Transform] q_attn and k_cache locations #334

kylesayrs · 2025-05-31T05:49:00Z

Because attention is very standardized in transformers via the AttentionInterface, this provides a convenient way to hook into attention and apply transforms

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs changed the base branch from main to kylesayrs/transform_factory May 31, 2025 05:49

Base automatically changed from kylesayrs/transform_factory to main June 10, 2025 15:24

wip concept

da36ca6

Signed-off-by: Kyle Sayers <[email protected]>

kylesayrs force-pushed the kylesayrs/transform_attn_locations branch from 96dee4e to da36ca6 Compare June 12, 2025 04:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] [Transform] q_attn and k_cache locations #334

[WIP] [Transform] q_attn and k_cache locations #334

Uh oh!

kylesayrs commented May 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

[WIP] [Transform] q_attn and k_cache locations #334

Are you sure you want to change the base?

[WIP] [Transform] q_attn and k_cache locations #334

Uh oh!

Conversation

kylesayrs commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

kylesayrs commented May 31, 2025 •

edited

Loading