-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] Docker image v0.4.4.post3-cu125 is labeled as CUDA 12.5 (cu125), but it actually contains CUDA 12.4.
#4952
opened Mar 31, 2025 by
cynial
5 tasks done
NotImplementedError: Updating weights_map with disk offloading is not implemented yet
#4945
opened Mar 31, 2025 by
Gusha-nye
[Feature] use different lib so for fa3 in sgl-kernel
high priority
#4941
opened Mar 31, 2025 by
zhyncs
2 tasks
[Bug] 0.0.0.0 host not supported
bug
Something isn't working
#4935
opened Mar 30, 2025 by
gyin94
2 of 5 tasks
[Bug] Qwen/Qwen2.5-VL-7B-Instruct-AWQ sglang response time longer
#4916
opened Mar 30, 2025 by
bao231
5 tasks
[Bug] Remove stream sync in fast decode plan of flashinfer mla backend
bug
Something isn't working
flashinfer
#4905
opened Mar 29, 2025 by
Fridge003
5 tasks
[Feature] The performance fails to meet expectations.
#4903
opened Mar 29, 2025 by
bebilli
2 tasks done
[Bug] Missing Torch and GGUF dependency in v0.4.4.post3 in Docker image
#4900
opened Mar 29, 2025 by
davidsyoung
5 tasks done
[Bug] Potential Issue with Querying Log Probabilities
high priority
#4892
opened Mar 29, 2025 by
yangky11
5 tasks done
[Bug] zmq.error.ZMQError: Cannot assign requested address (addr='tcp://addr:port')
#4877
opened Mar 29, 2025 by
HoyTiger
5 tasks done
[Bug] Logprobs overflow to -3.4e+38
bug
Something isn't working
#4876
opened Mar 29, 2025 by
zhc7
5 tasks done
Add Docs about attention backend, maybe we need to add a comprehensive one with Support Matrix such as:
#4865
opened Mar 28, 2025 by
hebiao064
[Bug] Executing Qwen2.5-Omni-7B on SGLang 0.4.4 post2: AttributeError: 'Qwen2_5OmniConfig' object has no attribute 'hidden_size'
#4862
opened Mar 28, 2025 by
didier-durand
4 of 5 tasks
[Bug] Run DeepSeek-v3-0324 on 2xH20 encountered cuda illegal memory access
#4856
opened Mar 28, 2025 by
cscyuge
5 tasks done
[Bug]
NameError: name 'CompressedTensorsW8A16Fp8' is not defined
#4851
opened Mar 28, 2025 by
vhain
5 tasks done
[Bug] DP attention with Eagle worker raises AttributeError
#4847
opened Mar 28, 2025 by
knukiban
5 tasks done
[Feature] Mooncake CPP (Chunked Pipeline Parallelism)
#4842
opened Mar 28, 2025 by
kingzevin
2 tasks done
[Bug] Multiple concurrency causes service exceptions - Qwen2.5-VL-3B-Instruct
#4838
opened Mar 28, 2025 by
fz5400
5 tasks done
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-28.