-
Notifications
You must be signed in to change notification settings - Fork 1.5k
[Bug] Run DeepSeek-v3-0324 on 2xH20 encountered cuda illegal memory access #4856
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
this bug is fixed by #4727, output logs:
|
+1 |
When setting |
+1 |
I'm author of #4727 。 |
Thank you for proposing the solution in #4727. I have tried it by adding
after https://github.com/sgl-project/sglang/blob/main/python/sglang/srt/layers/moe/fused_moe_triton/fused_moe.py#L154 |
"but unfortunately, the issue still persists when using the --enable-ep-moe option." |
Checklist
Describe the bug
Run DeepSeek-v3-0324 on 2xH20 encountered "RuntimeError: CUDA error: an illegal memory access was encountered".
output logs:
Reproduction
model: DeepSeek-V3-0324(https://huggingface.co/deepseek-ai/DeepSeek-V3-0324/tree/main)
running params:
node 0:
node 1:
benchmark script:
Environment
python3 -m sglang.check_env
outputs:sglang version: installed from source,
commit id: 52029bd
Date: Tue Mar 25 17:01:21 2025 +0800
The text was updated successfully, but these errors were encountered: