Skip to content

Pull requests: codeplaysoftware/cutlass-sycl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

A16S8 gemm && tensor-wise quantization release
#441 opened Jun 23, 2025 by jiyang1011 Loading…
New fp8 decode release
#439 opened Jun 19, 2025 by mehdi-goli Loading…
Unify f8 and mixed input pipelines release
#436 opened Jun 19, 2025 by t4c1 Loading…
[CI] -Werror
#435 opened Jun 16, 2025 by joeatodd Loading…
[PoC] Universal 2D copy
#433 opened Jun 16, 2025 by aacostadiaz Draft
[Draft] Add benchmark for GEMM
#432 opened Jun 16, 2025 by aacostadiaz Draft
First version of FP8 scaled_mm.
#428 opened Jun 11, 2025 by cfgfung Draft
Template atoms
#417 opened Jun 10, 2025 by t4c1 Draft
Cutlass 4.0
#385 opened May 20, 2025 by aacostadiaz Draft
Enable prefetch iteration
#382 opened May 19, 2025 by t4c1 Loading…
enable splitk for mixed precision gemm
#381 opened May 19, 2025 by taozha2 Loading…
Check the WarpLayout provided to TiledMMAHelper
#377 opened May 15, 2025 by joeatodd Loading…
FP8 Grouped GEMM CollectiveMma release
#351 opened May 1, 2025 by sanchitintel Loading…
add gemm with rmsnorm
#321 opened Apr 22, 2025 by yuankuns Loading…
Pure FP8 (W8A8) GEMM support (draft)
#306 opened Apr 14, 2025 by jiyang1011 Loading…
Enable SM90 via sycl-cuda-compat
#276 opened Mar 24, 2025 by FMarno Loading…
ProTip! Filter pull requests by the default branch with base:sycl-develop.