Skip to content

2D/3D Mask Attention issue #2257

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

2D/3D Mask Attention issue #2257

wants to merge 1 commit into from

Conversation

Cemberk
Copy link

@Cemberk Cemberk commented Jun 9, 2025

Issue seen during Huggingface Transformers, Cohere2 model tests during the test test_generation_beyond_sliding_window_0_flash_attention_2

Minimal Reproducer:

import torch
import torch.nn.functional as F

query = torch.randn(2, 32, 128, 64, device='cuda', dtype=torch.float16)
key = torch.randn(2, 32, 128, 64, device='cuda', dtype=torch.float16) 
value = torch.randn(2, 32, 128, 64, device='cuda', dtype=torch.float16)

mask = torch.ones(2, 128, device='cuda', dtype=torch.bool)

output = F.scaled_dot_product_attention(query, key, value, attn_mask=mask, is_causal=True)

@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Jun 9, 2025

Jenkins build for 70f58b228561dd635e742485616d0665d5e0f403 commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant