Skip to content

Pull requests: vllm-project/tpu-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix moe layer from upstream change ready ONLY add when PR is ready to merge/full CI is needed
#1274 opened Dec 10, 2025 by kyuyeunk Loading…
First check-in to add ci/cd test on tpuv7x ready ONLY add when PR is ready to merge/full CI is needed
#1270 opened Dec 9, 2025 by QiliangCui Loading…
add github action for check ready label ready ONLY add when PR is ready to merge/full CI is needed
#1269 opened Dec 9, 2025 by boe20211 Loading…
Replacing bit_width() with itemized_bits(). ready ONLY add when PR is ready to merge/full CI is needed
#1264 opened Dec 8, 2025 by aman2930 Loading…
3 tasks done
Add default 'auto' MODEL_IMPL_TYPE that resolves based on architecture ready ONLY add when PR is ready to merge/full CI is needed
#1255 opened Dec 5, 2025 by xingliu14 Loading…
[Kernel][FusedMoE] Fix MoE crash and hang issues ready ONLY add when PR is ready to merge/full CI is needed
#1252 opened Dec 5, 2025 by bythew3i Loading…
docs: update support matrices and improve visuals
#1250 opened Dec 5, 2025 by RobMulla Loading…
Avoid installing CUDA related stuff
#1246 opened Dec 4, 2025 by wdhongtw Loading…
Reduce image size and enhance caching
#1245 opened Dec 4, 2025 by wdhongtw Loading…
update run_in_docker script for running on local env ready ONLY add when PR is ready to merge/full CI is needed
#1243 opened Dec 4, 2025 by ernie-chang Loading…
Verify vllm-tpu python package (draft) ready ONLY add when PR is ready to merge/full CI is needed
#1241 opened Dec 4, 2025 by ylangtsou Draft
[CI] Fix awq dtype ready ONLY add when PR is ready to merge/full CI is needed
#1220 opened Dec 2, 2025 by kyuyeunk Loading…
[Oncall] update the SchedulerConfig interface
#1219 opened Dec 2, 2025 by bzgoogle Loading…
Add a SP e2e test.
#1209 opened Dec 2, 2025 by vanbasten23 Loading…
[RPA] Pipeline flash attention in default kernel ready ONLY add when PR is ready to merge/full CI is needed
#1203 opened Dec 1, 2025 by jrplatin Loading…
Save size in scalar scratch for bo and bq ready ONLY add when PR is ready to merge/full CI is needed
#1201 opened Dec 1, 2025 by rupengliu-meta Loading…
[Qwix/Flax] Upgrade to Flax 0.12.0 + Qwix 0.1.4
#1170 opened Nov 25, 2025 by jrplatin Loading…
[do not merge] test status check POC ready ONLY add when PR is ready to merge/full CI is needed
#1168 opened Nov 25, 2025 by khluu Loading…
[Feat][TPU Offload] KV cache offload to local cpu buffer ready ONLY add when PR is ready to merge/full CI is needed
#1163 opened Nov 24, 2025 by juncgu-google Loading…
ProTip! Filter pull requests by the default branch with base:main.