Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Misc] Make LayerBlockType a Literal instead of Enum ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs v1
#27658 opened Oct 28, 2025 by DarkLight1337 Loading…
5 tasks
[CI/Build] Move pre-commit only scripts to tools/pre_commit ci/build ready ONLY add when PR is ready to merge/full CI is needed
#27657 opened Oct 28, 2025 by DarkLight1337 Loading…
5 tasks
[FLA] Introduce Kimi Delta Attention(KDA) to VLLM
#27654 opened Oct 28, 2025 by zhiyuan1i Loading…
5 tasks
[ROCm] [AITER] Add block scaled bpreshuffle gemm rocm Related to AMD ROCm
#27652 opened Oct 28, 2025 by tjtanaa Draft
5 tasks
[Core] Support async-scheduling with kvconnector v1
#27647 opened Oct 28, 2025 by KevinCheung2259 Loading…
5 tasks
[MTP] Refactor mtp predictor to avoid d2h operation deepseek Related to DeepSeek models
#27643 opened Oct 28, 2025 by MengqingCao Loading…
5 tasks
[Frontend] Add vllm bench sweep to CLI ci/build documentation Improvements or additions to documentation frontend performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#27639 opened Oct 28, 2025 by DarkLight1337 Loading…
5 tasks
feature: support ipv6 in vllm distributed
#27638 opened Oct 28, 2025 by brook-cpp Loading…
5 tasks
[NIXL][XPU] update name of nixl wheel kv-connector ready ONLY add when PR is ready to merge/full CI is needed
#27631 opened Oct 28, 2025 by zhenwei-intel Loading…
[Core][Bookkeeping] Update cu_num_accepted_tokens for all req_index v1
#27629 opened Oct 28, 2025 by Jialin Loading…
3 of 5 tasks
Fix MiniMax-M2 rmsnorm precision and remove useless code ready ONLY add when PR is ready to merge/full CI is needed
#27627 opened Oct 28, 2025 by rogeryoungh Loading…
5 tasks
[ROCm][Platform] Add MI308X device id in _ROCM_DEVICE_ID_NAME_MAP rocm Related to AMD ROCm
#27623 opened Oct 28, 2025 by sammysun0711 Loading…
3 of 5 tasks
[Add] cmdline argument parsing for KV cache offloading modules
#27621 opened Oct 28, 2025 by ApostaC Loading…
6 tasks
[Model] make the inv_freq in Qwen2_5_VisionRotaryEmbedding device-agnostic qwen Related to Qwen models
#27617 opened Oct 28, 2025 by yangulei Loading…
[BUG] Make 'binary' default option for saving torch compile artifacts when using standalone_compile ready ONLY add when PR is ready to merge/full CI is needed
#27616 opened Oct 28, 2025 by ahao-anyscale Draft
2 of 5 tasks
[AsyncScheduling] Make async overlap work with logprobs ready ONLY add when PR is ready to merge/full CI is needed v1
#27615 opened Oct 27, 2025 by njhill Loading…
[WIP] Enable async scheduling by default kv-connector ready ONLY add when PR is ready to merge/full CI is needed structured-output suppress-bc-linter tpu Related to Google TPUs v1
#27614 opened Oct 27, 2025 by njhill Loading…
add a warmup mode in CUDAGraphMode v1
#27613 opened Oct 27, 2025 by bangshengtang Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.