Skip to content

Conversation

@kaniel-outis
Copy link
Contributor

@kaniel-outis kaniel-outis commented Nov 6, 2025

Add testing fused moe:

  1. Generalization for hiddensize

  2. Test for eplb. Because the small operator does not support the scenario where topk=-1, in the test cases we use global_base_prefix_sum to calculate eplb and compare the results with the fusion operator. Since the output of the fusion operator eplb is an auto ep_recv_count = at::empty({num_local_experts * num_ranks}, at::dtype(at::kInt).device(device)); shape=[num_local_experts * num_ranks] one-dimensional tensor, but global_base_prefix_sum is a shape=[num_local_experts, num_ranks], we perform some preprocessing on global_base_prefix_sum to match the fusion operator ep_recv_count.

@gemini-code-assist
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

Yael-X
Yael-X previously approved these changes Nov 7, 2025
Yael-X
Yael-X previously approved these changes Nov 18, 2025
@Yael-X Yael-X merged commit 96d5c5a into sgl-project:main Nov 24, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants