Skip to content

Remove rv_all from gpu generated code#5

Open
PearCoding wants to merge 1 commit intoAnyDSL:articfrom
PearCoding:bug/rv_all
Open

Remove rv_all from gpu generated code#5
PearCoding wants to merge 1 commit intoAnyDSL:articfrom
PearCoding:bug/rv_all

Conversation

@PearCoding
Copy link

GPU kernels get polluted by the rv_all instruction. The instruction will be filtered out only if Thorin is compiled with RV. This might not happen, as we should not expect RV to be available when Rodent is used only for GPU.
The "fix" is quite simple and it would be great to get rid of the code duplication, but that is beyond the task of this simple PR.

@Hugobros3
Copy link

We should probably have some kind of generic portable SIMD intrinsics that work regardless of the platform (RV, CUDA, AMDHSA, OpenCL, Shady...)

@PearCoding
Copy link
Author

Yes I agree. We should also include the "fma" instruction to the math builtins (maybe with a fallback for non LLVM, e.g, OpenCL, etc). I think there are more general purpose intrinsic which might be handy on all systems - if a well-behaving fallback can be defined.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants