[wave2water] E2E execution of matmul kernel via water middle-end #672

tyb0807 · 2026-01-03T00:11:37Z

Fixes #600. Requires #667.

Signed-off-by: tyb0807 <[email protected]>

ftynse · 2026-01-05T08:49:50Z

lit_tests/kernel/wave/mlir_converter_e2e.py

+
+    @tkw.wave(constraints)
+    def matmul(
+        a: tkl.Memory[M, K, ADDRESS_SPACE, dtype],
+        b: tkl.Memory[N, K, ADDRESS_SPACE, dtype],
+        c: tkl.Memory[M, N, GLOBAL_ADDRESS_SPACE, tkl.f32],
+    ):
+        c_reg = tkl.Register[M, N, tkl.f32](0.0)
+
+        @tkw.iterate(K, init_args=[c_reg])
+        def repeat(acc: tkl.Register[M, N, tkl.f32]) -> tkl.Register[M, N, tkl.f32]:
+            a_reg = tkw.read(a)
+            b_reg = tkw.read(b)
+            acc = tkw.mma(a_reg, b_reg, acc)
+            return acc
+
+        tkw.write(repeat, c)


Can we use one from templates?

ftynse · 2026-01-05T08:50:26Z

lit_tests/kernel/wave/mlir_converter_e2e.py

+    # Apply Water PassManager lowering
+    lowered_mlir = apply_water_middle_end_passes(wave_dialect_mlir)
+
+    print(lowered_mlir)


Let's have FileCheck comments here so it doesn't look like forgotten debug output.

ftynse · 2026-01-05T08:51:05Z

lit_tests/kernel/wave/mlir_converter_e2e.py

+        compile_to_mlir=True,
+        location_capture_config=LocationCaptureConfig(level=LocationCaptureLevel.NONE),
+        enforce_locations=False,
+        print_mlir=True,


Do we need to print mlir?

ftynse · 2026-01-05T08:51:55Z

lit_tests/kernel/wave/mlir_converter_e2e.py

+    b_tensor = device_randn(n, k, dtype=torch.float16)  # Note: transposed in matmul
+    c_tensor = device_zeros(m, n, dtype=torch.float32)
+
+    # Expected result (CPU computation)


The code above creates tensors on device, why is this called CPU computation?

ftynse · 2026-01-05T08:52:20Z

lit_tests/kernel/wave/mlir_converter_e2e.py

+    c_tensor = device_zeros(m, n, dtype=torch.float32)
+
+    # Expected result (CPU computation)
+    expected = torch.matmul(a_tensor.float(), b_tensor.T.float())


It is a bad idea to compute expected values with a higher precision than actual values.

ftynse · 2026-01-05T08:52:53Z

lit_tests/kernel/wave/mlir_converter_e2e.py

+
+    compiled_e2e(a_tensor, b_tensor, c_tensor)
+
+    assert_close(c_tensor, expected, rtol=1e-3, atol=1e-3)


1e-3 looks a bit too lax, do we really need it?

tyb0807 requested review from ftynse and tgymnich January 3, 2026 00:11

[wave2water] E2E execution of matmul kernel via water middle-end

83e9692

Signed-off-by: tyb0807 <[email protected]>

tyb0807 force-pushed the e2e_mm branch from 6630aac to 83e9692 Compare January 3, 2026 19:09

ftynse reviewed Jan 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[wave2water] E2E execution of matmul kernel via water middle-end #672

[wave2water] E2E execution of matmul kernel via water middle-end #672

Uh oh!

tyb0807 commented Jan 3, 2026

Uh oh!

ftynse Jan 5, 2026

Uh oh!

ftynse Jan 5, 2026

Uh oh!

ftynse Jan 5, 2026

Uh oh!

ftynse Jan 5, 2026

Uh oh!

ftynse Jan 5, 2026

Uh oh!

ftynse Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		compiled_e2e(a_tensor, b_tensor, c_tensor)

		assert_close(c_tensor, expected, rtol=1e-3, atol=1e-3)

[wave2water] E2E execution of matmul kernel via water middle-end #672

Are you sure you want to change the base?

[wave2water] E2E execution of matmul kernel via water middle-end #672

Uh oh!

Conversation

tyb0807 commented Jan 3, 2026

Uh oh!

ftynse Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

ftynse Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

ftynse Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

ftynse Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

ftynse Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

ftynse Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants