Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Simplify GPU reset handling #17646

Merged
merged 5 commits into from
Mar 28, 2025
Merged

[CI] Simplify GPU reset handling #17646

merged 5 commits into from
Mar 28, 2025

Conversation

sarnex
Copy link
Contributor

@sarnex sarnex commented Mar 25, 2025

Just reset all i915 and xe devices if they exist instead of having to specify it in every workflow call.

Also I think the existing code to reset i915 devices wasn't actually working because you need sudo to access the /sys/kernel/debug directory in any way, it's fixed now.

@sarnex sarnex temporarily deployed to WindowsCILock March 25, 2025 20:34 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 25, 2025 20:35 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 25, 2025 20:51 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 25, 2025 20:51 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 25, 2025 21:01 — with GitHub Actions Inactive
@sarnex sarnex requested a deployment to WindowsCILock March 26, 2025 14:37 — with GitHub Actions In progress
@sarnex sarnex temporarily deployed to WindowsCILock March 26, 2025 14:47 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 26, 2025 15:56 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 26, 2025 19:56 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 26, 2025 20:32 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 26, 2025 20:32 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 26, 2025 20:48 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 26, 2025 20:48 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 26, 2025 21:01 — with GitHub Actions Inactive
Signed-off-by: Sarnie, Nick <[email protected]>
Signed-off-by: Sarnie, Nick <[email protected]>
Signed-off-by: Sarnie, Nick <[email protected]>
@sarnex sarnex temporarily deployed to WindowsCILock March 28, 2025 14:22 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 28, 2025 14:22 — with GitHub Actions Inactive
@sarnex sarnex temporarily deployed to WindowsCILock March 28, 2025 15:28 — with GitHub Actions Inactive
@sarnex sarnex marked this pull request as ready for review March 28, 2025 15:31
@sarnex sarnex requested a review from a team as a code owner March 28, 2025 15:31
@sarnex sarnex requested a deployment to WindowsCILock March 28, 2025 15:34 — with GitHub Actions In progress
Copy link
Contributor

@aelovikov-intel aelovikov-intel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot!

@sarnex sarnex merged commit 8d34f2a into intel:sycl Mar 28, 2025
28 of 32 checks passed
mateuszpn pushed a commit to mateuszpn/llvm that referenced this pull request Mar 31, 2025
Just reset all i915 and xe devices if they exist instead of having to
specify it in every workflow call.

Also I think the existing code to reset i915 devices wasn't actually
working because you need sudo to access the `/sys/kernel/debug`
directory in any way, it's fixed now.

---------

Signed-off-by: Sarnie, Nick <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants