Skip to content

Master: Develop TMA+Warp-Specialization support for all schedulers #5364

@liqiangxl

Description

@liqiangxl

Current non-matmul schedulers, except the inner-outer persistent scheduler, still rely on the classical multi-wave approach.
A TMA + Warp-Specialization variant should be developed for these schedulers, as it has demonstrated substantial speedups in the inner-outer persistent scheduler.

Sub-issues

Metadata

Metadata

Assignees

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions