start merging early thread exit handling #1584

jcosborn · 2025-06-20T23:19:28Z

This covers all files except the dslashes. Those will be merged later to keep the complexity of this PR down.

jcosborn · 2025-07-03T17:53:46Z

@maddyscientist As for simplifying the calls to dslash.template operator() in dslash_helper.cuh, actually only the dslashes that needed the allthreads handling have been updated to take the extra parameters, the others can't currently be called that way. We could simplify the calling code if we made all dslashes have the same interface, but I haven't done that mainly to reduce the number of changes. I can easily implement that if you prefer it.

maddyscientist · 2025-07-07T21:20:03Z

@jcosborn for those QUDA developers not on the portability calls (who are reviewing this PR) can you describe why these changes are needed?

jcosborn · 2025-07-08T16:55:44Z

The early thread exit handling changes are to support programming models (like SYCL) which only support block collectives when all threads in a block are active (non-exited). The changes allow targets to have all threads enter a kernel functor to participate in the block collectives when block collectives are used in a kernel. For these kernels, instead of exiting when a thread is determined to be out-of-bounds for the kernel, all threads can enter the kernel, and the out-of-bounds ones will be marked inactive with an extra argument. Only kernels that need this handling need any changes. The kernel functors that need this handling have an extra template parameter allthreads which is true when it is being called with all threads entering (with some possibly out-of-bounds), and when false the functor should behave exactly as before. There is also an extra functor argument active which specifies if the thread is in-bounds (active) or not. When allthreads is true, the modified kernels then need to ensure that no out-of-bounds memory accesses occur from threads that aren't active, and also ensure that all threads (active or not) participate in the collectives.

start merging early thread exit handling

aab7d0d

jcosborn requested review from a team as code owners June 20, 2025 23:19

jcosborn added 3 commits June 20, 2025 18:21

revert change

52ad846

revert inadvertent changes

73df3df

remove gridsize setting

7edea3b

jcosborn added 5 commits July 8, 2025 12:58

Merge branch 'develop' into feature/sycl-merge

72976fe

Merge branch 'develop' into feature/sycl-merge

db814b6

remove QUDA_FAST_COMPILE_DSLASH

cb9cd53

Merge branch 'develop' into feature/sycl-merge

0e47403

change active to alive

0c4a208

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

start merging early thread exit handling #1584

start merging early thread exit handling #1584

Uh oh!

jcosborn commented Jun 20, 2025

Uh oh!

jcosborn commented Jul 3, 2025

Uh oh!

maddyscientist commented Jul 7, 2025

Uh oh!

jcosborn commented Jul 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

start merging early thread exit handling #1584

Are you sure you want to change the base?

start merging early thread exit handling #1584

Uh oh!

Conversation

jcosborn commented Jun 20, 2025

Uh oh!

jcosborn commented Jul 3, 2025

Uh oh!

maddyscientist commented Jul 7, 2025

Uh oh!

jcosborn commented Jul 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants