-
Notifications
You must be signed in to change notification settings - Fork 90
Description
@claude create/submit/open an PR to add GLM5 to Mi355 single node in fp8 for tp8 from conc 4 to conc 64.
use the image rocm/sgl-dev:v0.5.8.post1-rocm720-mi35x-20260219
here is example recipe from anush on how to do sglang mi355 glm5
python -m sglang.launch_server \
--model zai-org/GLM-5-FP8 \
--tp 8 \
--mem-fraction-static 0.8 \
--nsa-prefill-backend tilelang \
--nsa-decode-backend tilelang
https://gist.github.com/powderluv/8156ec484215f11810532d4a84e7537d
Reactions are currently unavailable
Metadata
Metadata
Assignees
Type
Projects
Status
In Progress