Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q4 2024
#9006 opened Oct 1, 2024 by simon-mo
Open 18
vLLM's V1 Engine Architecture
#8779 opened Sep 24, 2024 by simon-mo
Open 9
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: Not able to run LLama3 LoRA with --fully-sharded-loras bug Something isn't working
#10342 opened Nov 14, 2024 by xyang16
1 task done
[Bug]: Out of Memory (OOM) Issues During MMLU Evaluation with lm_eval bug Something isn't working
#10325 opened Nov 14, 2024 by wchen61
1 task done
[Usage]: Request to include vllm==0.6.2 for cuda 11.8 usage How to use vllm
#10319 opened Nov 14, 2024 by amew0
1 task done
[Bug]: FusedMoE kernel performance depends on input prompt length while decoding bug Something isn't working
#10313 opened Nov 14, 2024 by taegeonum
1 task done
[Usage]: how to use vllm to output code only usage How to use vllm
#10309 opened Nov 14, 2024 by shaoyuyoung
1 task done
[Installation]: Build vllm environment error installation Installation problems
#10303 opened Nov 13, 2024 by Kawai1Ace
1 task done
[Bug]: vllm-openai is outdated bug Something isn't working
#10301 opened Nov 13, 2024 by matbee-eth
[Bug]: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12 bug Something isn't working
#10300 opened Nov 13, 2024 by yananchen1989
1 task done
[Bug]: VLLLm crash when running Qwen/Qwen2.5-Coder-32B-Instruct on two H100 GPUs bug Something isn't working
#10296 opened Nov 13, 2024 by noamwies
1 task done
[Bug]: Can't use yarn rope config for long context in Qwen2 model bug Something isn't working
#10293 opened Nov 13, 2024 by FlyCarrot
1 task done
[Feature]: 2D TP & EP feature request
#10289 opened Nov 13, 2024 by WenhaoHe02
1 task done
[Bug]: Speculative Decoding + TP on Spec Worker + Chunked Prefill does not work. bug Something isn't working
#10276 opened Nov 13, 2024 by andoorve
1 task done
ProTip! Mix and match filters to narrow down what you’re looking for.