Replies: 2 comments 1 reply
-
In |
Beta Was this translation helpful? Give feedback.
-
I tried using |
Beta Was this translation helpful? Give feedback.
-
In |
Beta Was this translation helpful? Give feedback.
-
I tried using |
Beta Was this translation helpful? Give feedback.
-
Greetings -
Can anyone point me to the headers needed to begin trial and error implementation of various new cublas build variables? Should I be working with the fattn...cuh and related ggml headers, or are there any llama specific headers I should be specifically looking into?
I appreciate any pointers - novice here, although I'm sure I've made that apparent already.
There are some preexisting variables I want to work with, and also some new following recent changes to cuda toolkit 12.9.
I am wanting to set and benchmark:
cublasSetEmulationStrategy() cublasEmulationStrategy_t
cublasEmulationStrategy_t CUBLAS_EMULATION_STRATEGY_PERFORMANT
vs
cublasEmulationStrategy_t CUBLAS_EMULATION_STRATEGY_EAGER
and since CUBLAS_TENSOR_OP_MATH is deprecated I am wanting to elect either CUBLAS_DEFAULT_MATH or CUBLAS_TF32_TENSOR_OP_MATH directly.
It would also be nice to run the occasional Bfloat, so I imagine that would look something like:
cublasLtMatmul() cublasComputeType_t
cublasComputeType_t CUBLAS_COMPUTE_32F_FAST_16BF
I don't imagine cmake knows how to handle these build variables without specific corresponding flags, right?
Anyway, if anyone can suggest a starting point, that would be great. Thanks
Beta Was this translation helpful? Give feedback.
All reactions