Skip to content

[Liger-Kernel][LAYER NORM]Performance Discrepancy of Triton vs. Torch on PVC Compared to Nvidia #3702

@Tarakarevu1

Description

@Tarakarevu1

Describe the issue

###Describe the issue
For [layer_norm] we collected benchmark data in

Triton
Torch
Torch.compile with three different modes: default, reduce-overhead, and max-autotune
Our findings indicate a notable performance discrepancy between Triton and Torch on different hardware platforms, specifically PVC and Nvidia H100 GPUs.

Let me know if you need more details.

Environment details

Triton: 3.2
GPU: Intel GPU Max 1550, H-100

Metadata

Metadata

Type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions