Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Blockscaled Ragged Contiguous Grouped Gemm for MoEs
#2790 opened Nov 21, 2025 by Shreya-gaur Loading…
add cute.union
#2788 opened Nov 21, 2025 by v0i0 Loading…
Fix: print subbyte<T> compilation error
#2783 opened Nov 19, 2025 by chrisHuxi Loading…
WIP: OSS CI Testing
#2776 opened Nov 15, 2025 by zekunf-nv Loading…
add dump_patch.py
#2767 opened Nov 13, 2025 by AlexSJJ Loading…
Remove prints from fmha fwd kernels
#2765 opened Nov 12, 2025 by milesvant Loading…
Fix example in CuTe tutorials
#2752 opened Nov 6, 2025 by StevenYangCC Loading…
63_hopper_gemm_with_weight_prefetch: C++17 compat
#2744 opened Nov 3, 2025 by javacruft Loading…
Minor fix cute dsl example paths
#2741 opened Nov 1, 2025 by Edenzzzz Loading…
Fix register index bug in mma.sync.aligned.m16n8k16
#2740 opened Nov 1, 2025 by neilkichler Loading…
[doc] Update flags for CUDA 13
#2731 opened Oct 28, 2025 by chengscott Loading…
[DOC] Fix broken links
#2729 opened Oct 28, 2025 by Echo-Nie Loading…
fix: use add_help=False in temporary parser
#2721 opened Oct 24, 2025 by dwhswenson Loading…
Support PDL for SM90 Array TMA GEMM
#2719 opened Oct 24, 2025 by HydraQYH Loading…
remove invalid template parameters
#2718 opened Oct 24, 2025 by arnej27959 Loading…
Latex Printing Support
#2704 opened Oct 19, 2025 by depaulmillz Loading…
fix half_t numeric_limits digits inactive-30d
#2690 opened Oct 12, 2025 by Aminsed Loading…
add tf32 cvt ptx for sm80 inactive-30d
#2689 opened Oct 12, 2025 by Aminsed Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.