feat(benchmark): add MoE FP4 small-batch shapes and enable MoE by def… by zhiding512 · Pull Request #332 · ROCm/FlyDSL

zhiding512 · 2026-04-01T14:32:58Z

…ault Signed-off-by: zhimding <zhimding@amd.com>

Copilot

Pull request overview

Updates the benchmark runner to broaden MoE FP4 coverage (especially small-batch/token shapes) and to include MoE benchmarks in the default run set.

Changes:

Add additional small-token MoE FP4 benchmark shapes.
Add a larger-token MoE FP4 shape for the (7168, 2048) configuration.
Enable MoE benchmarks by default in run_benchmark.sh.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-01T14:37:31Z

scripts/run_benchmark.sh

 RUN_SOFTMAX=1
 RUN_LAYERNORM=1
 RUN_RMSNORM=1
 RUN_PRESHUFFLE_GEMM=1
-RUN_MOE=0
+RUN_MOE=1


The comment describing the default selected ops is now out of date: RUN_MOE is enabled by default, but the comment still says only softmax/norms/GEMM run unless the user enables others. Please update the comment so it matches the current default behavior (and/or adjust the wording around how to include/exclude MoE).

- Replace bytes_moved in stage1/stage2 with corrected formula: use aE=min(tokens*topk, experts) for weight bytes, proper scale group sizes (32 for FP4 MX, group_size for INT4, K for per-channel) - Skip MoE FP4 benchmark on non-gfx950 (e.g. MI325/gfx942) with 'Skipping FP4' message matched by run_benchmark.sh grep filter Signed-off-by: zhimding <zhimding@amd.com>

feat(benchmark): add MoE FP4 small-batch shapes and enable MoE by def…

e1de064

…ault Signed-off-by: zhimding <zhimding@amd.com>

Copilot AI review requested due to automatic review settings April 1, 2026 14:32

zhiding512 marked this pull request as draft April 1, 2026 14:34

Copilot started reviewing on behalf of zhiding512 April 1, 2026 14:34 View session

Copilot AI reviewed Apr 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(benchmark): add MoE FP4 small-batch shapes and enable MoE by def…#332

feat(benchmark): add MoE FP4 small-batch shapes and enable MoE by def…#332
zhiding512 wants to merge 2 commits intomainfrom
zhimding/update_mixed_moe_0401

zhiding512 commented Apr 1, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zhiding512 commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zhiding512 commented Apr 1, 2026 •

edited

Loading