-
Notifications
You must be signed in to change notification settings - Fork 237
Pull requests: ROCm/aiter
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add tuned csv files for Gemm and MoE to accelerate Kimi-K2
#2290
opened Mar 16, 2026 by
wuhuikx
Loading…
Add 32x128 and 64x128 asm kernels to support qwen3-next TP4
ci:atom
#2285
opened Mar 15, 2026 by
JohnNikolay84
Loading…
Use arch-aware fp8 dtype in tests for gfx1250 support
#2284
opened Mar 15, 2026 by
sunway513
Loading…
3 of 4 tasks
Add gfx1250 support: GFX_MAP + default GEMM/MHA configs
#2282
opened Mar 15, 2026 by
sunway513
Loading…
3 of 5 tasks
[Triton MoE] Add optimized Gluon kernel for AMD CDNA3 with K-dimension unrolling
#2277
opened Mar 14, 2026 by
jwu10003
Loading…
1 task done
[TRITON] Add Attention support to the bench_models benchmarking script
ci:all
enhancement
New feature or request
triton
#2274
opened Mar 13, 2026 by
lucas-santos-amd
Loading…
1 task
Add topk implementations: csdn baseline, csdn reduceAtomic, carlus ba…
#2270
opened Mar 13, 2026 by
chuanbowang2026
Loading…
2 tasks
CI: Switch Triton tests from MI325 to MI355 runners
#2269
opened Mar 13, 2026 by
gyohuangxin
Loading…
2 tasks
Introduce asm fmoe kernels that do not require bf16->fp8 quantization
ci:atom
ci:sglang
ci:vllm
#2262
opened Mar 12, 2026 by
JohnNikolay84
Loading…
1 task
Gemm & moe tunning for DeepSeek-R1 in InferenceX FP4 case
#2261
opened Mar 12, 2026 by
inkcherry
Loading…
1 task
[TRITON] Update config of MHA PE forward kernel on New feature or request
triton
gfx950
ci:atom
ci:sglang
ci:vllm
enhancement
#2260
opened Mar 12, 2026 by
brunomazzottiamd
Loading…
1 task done
Add performance parity tests for AITER kernels
#2258
opened Mar 12, 2026 by
ChuanLi1101
Loading…
4 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.