-
Notifications
You must be signed in to change notification settings - Fork 510
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[v0.11.0][BugFix][P/D] Modify the recalculation logic to prevent waiting requests from filling up the D node KVCache
#3686
opened Oct 23, 2025 by
underfituu
Loading…
[WIP][docs] add aclgraph developer guide
documentation
Improvements or additions to documentation
#3683
opened Oct 23, 2025 by
zzzzwwjj
Loading…
add mrope fusion op
module:core
module:ops
module:tests
ready
read for review
ready-for-test
start test by label for PR
#3680
opened Oct 23, 2025 by
shaopeng-666
Loading…
[Feat] Delete redundant operations in model_runner and forward_context
module:core
#3677
opened Oct 23, 2025 by
realliujiaxu
Loading…
[BugFix][Core] Fix a bug running multi-modal with ascend_scheduler
documentation
Improvements or additions to documentation
#3675
opened Oct 23, 2025 by
whx-sjtu
Loading…
[Refactor] Refactor Ascend attention implementation forward
module:core
module:tests
ready
read for review
ready-for-test
start test by label for PR
#3674
opened Oct 23, 2025 by
yiz-liu
Loading…
[Feature] Remove stream synchronization during ring_mla
module:tests
#3672
opened Oct 23, 2025 by
jianzs
Loading…
[CI] Refactor nightly workflow
merge-conflicts
module:tests
#3671
opened Oct 23, 2025 by
Potabk
Loading…
[Main][Perf] Add fused matmul/reduce-scatter kernel for performance optimization.
module:ops
ready
read for review
ready-for-test
start test by label for PR
#3669
opened Oct 23, 2025 by
ZYang6263
Loading…
[Feature] Reduce stream count for shared expert multistream
module:ops
ready
read for review
ready-for-test
start test by label for PR
#3667
opened Oct 23, 2025 by
jianzs
Loading…
[Docs] Add InternVL series tutorial for single NPU
documentation
Improvements or additions to documentation
#3664
opened Oct 23, 2025 by
gcanlin
Loading…
[BugFix]Check all expert maps when using muilty instance.
module:ops
#3662
opened Oct 23, 2025 by
offline893
Loading…
[Doc] Update installation instructions
documentation
Improvements or additions to documentation
#3661
opened Oct 23, 2025 by
toolsmanhehe
Loading…
support FULL_AND_PIECEWISE graph mode.
module:core
module:tests
#3659
opened Oct 23, 2025 by
momo609
Loading…
[Test]Add accuracy test for model ERNIE-4.5-21B-A3B-PT
module:tests
#3658
opened Oct 23, 2025 by
MrZ20
Loading…
Optimizing PD-Proxy for First Token Early Return (Better & More Stable TTFT)
#3656
opened Oct 23, 2025 by
h225yang
Loading…
[v0.11.0][Feat] Prefetching Attention QKV Linear Weight With
AddRmsNormQuant
Custom Op
module:ops
module:tests
#3649
opened Oct 23, 2025 by
zhoux77899
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-10-20.