-
-
Notifications
You must be signed in to change notification settings - Fork 7.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TPU][Test] Add script to run benchmark on TPU for buildkite
ci/build
#19039
opened Jun 2, 2025 by
QiliangCui
Loading…
Update docker docs with ARM CUDA cross-compile
documentation
Improvements or additions to documentation
#19037
opened Jun 2, 2025 by
mgoin
Loading…
Update setuptools_scm dependency on setuptools>75
ci/build
#19035
opened Jun 2, 2025 by
pramenku
Loading…
[Bugfix][EP+DP] Use pplx-kernel internode instead of intranode
#19034
opened Jun 2, 2025 by
tlrmchlsmth
Loading…
[Bugfix] Fix EAGLE vocab embedding construction for Llama 70B
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#19033
opened Jun 2, 2025 by
benchislett
Loading…
[Bugfix] [Model] Fixed idefics3 bug after transformers update
#19032
opened Jun 2, 2025 by
cchadowitz
Loading…
[Bugfix] get_num_blocks_to_allocate with null_block
v1
#19031
opened Jun 2, 2025 by
heheda12345
Loading…
[Bugfix] Max concurrency estimation and check_enough_kv_cache_memory for models with sliding window layers
v1
#19029
opened Jun 2, 2025 by
heheda12345
Loading…
chore: add an alternative Ubuntu software source to speedup docker image building
ci/build
#19028
opened Jun 2, 2025 by
acelyc111
Loading…
Rename eagle cache dir
ready
ONLY add when PR is ready to merge/full CI is needed
#19027
opened Jun 2, 2025 by
zou3519
Loading…
[Bugfix] Improve JSON extraction in LlamaToolParser
frontend
tool-calling
#19024
opened Jun 2, 2025 by
key4ng
Loading…
[Doc] Remove duplicate TOCs during MkDocs migration
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#19021
opened Jun 2, 2025 by
Zerohertz
Loading…
[Bugfix] Use cmake 3.26.1 instead of 3.26 to avoid build failure
ci/build
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#19019
opened Jun 2, 2025 by
houseroad
Loading…
[v1][KVCacheManager] Rename BlockHashType to BlockHash
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#19015
opened Jun 2, 2025 by
heheda12345
Loading…
[draft] add some nvtx ranges for vllm for diagnostic
documentation
Improvements or additions to documentation
needs-rebase
speculative-decoding
v1
fix prefix caching logic for running requests without speculative tokens
v1
#19006
opened Jun 1, 2025 by
yuguo68
Loading…
[bugfix] small fix logic issue
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#18999
opened Jun 1, 2025 by
reidliu41
Loading…
[DRAFT] Self-Speculative Decoding using LayerSkip
documentation
Improvements or additions to documentation
needs-rebase
speculative-decoding
v1
#18994
opened May 31, 2025 by
aniltolwani
•
Draft
[ROCm] [AITER] [Bugfix] Patch for AITER commit
648764942e552a8bb5fe16026703716a81f05374
ci/build
#18990
opened May 31, 2025 by
tjtanaa
Loading…
[Benchmark] Add hf_stream arg to enable or disable datasets streaming loading
#18989
opened May 31, 2025 by
Potabk
Loading…
Add tarsier model support
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
ready
ONLY add when PR is ready to merge/full CI is needed
#18985
opened May 31, 2025 by
princepride
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.