[WIP] Update newest #4142

22dimensions · 2025-11-12T04:47:02Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@2918c1b

github-actions · 2025-11-12T04:47:12Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request updates the codebase to support a newer version of the vLLM library. The changes primarily involve adapting to moved utility functions and API modifications. While the changes appear functional, there's a significant amount of code duplication introduced for handling version-dependent imports (e.g., cdiv, init_cached_hf_modules) across numerous files. I strongly recommend refactoring this logic into a centralized compatibility utility module to enhance maintainability. Additionally, there's duplicated logic in vllm_ascend/models/qwen2_5_vl.py for processing visual inputs that could be extracted into a helper method.

gemini-code-assist · 2025-11-12T04:50:25Z

vllm_ascend/attention/attention_v1.py

+from vllm_ascend.utils import vllm_version_is
+
+if vllm_version_is("0.11.0"):
+    from vllm.utils import cdiv
+else:
+    from vllm.utils.math_utils import cdiv


This conditional import logic for cdiv is repeated in many other files (attention/mla_v1.py, core/scheduler.py, distributed/mooncake/config_data.py, patch/platform/patch_mamba_config.py, spec_decode/mtp_proposer.py, torchair/torchair_attention.py, torchair/torchair_mla.py, torchair/torchair_sfa.py, worker/block_table.py, worker/model_runner_v1.py). A similar pattern exists for init_cached_hf_modules in worker/worker_v1.py and its test file. This widespread duplication makes the code harder to maintain and prone to errors if future updates are needed.

To improve this, I suggest creating a central compatibility utility module (e.g., vllm_ascend/utils/compat.py) to house all such version-dependent imports. Then, other files can import cdiv, init_cached_hf_modules, etc., directly from this new module, centralizing the version-checking logic.

gemini-code-assist · 2025-11-12T04:50:25Z

vllm_ascend/models/qwen2_5_vl.py

+            if vllm_version_is("0.11.0"):
+                image_embeds = self.visual(pixel_values, grid_thw=grid_thw)
+            else:
+                with set_ascend_forward_context(None, self.vllm_config):
+                    image_embeds = self.visual(pixel_values, grid_thw=grid_thw)


This if/else block to handle different vLLM versions is duplicated in _process_video_input (lines 561-567). This duplicated logic should be extracted into a private helper method to improve code clarity and maintainability. For example, you could create a _run_visual method:

def _run_visual(self, pixel_values, grid_thw): if vllm_version_is("0.11.0"): return self.visual(pixel_values, grid_thw=grid_thw) else: with set_ascend_forward_context(None, self.vllm_config): return self.visual(pixel_values, grid_thw=grid_thw)

Then you can call this helper in both _process_image_input and _process_video_input.

image_embeds = self._run_visual(pixel_values, grid_thw=grid_thw)

gemini-code-assist · 2025-11-12T04:50:25Z

vllm_ascend/patch/platform/patch_mamba_config.py

+
 from vllm.v1.kv_cache_interface import FullAttentionSpec, MambaSpec

 from vllm_ascend.utils import vllm_version_is


This import of vllm_version_is is a duplicate of the one on line 7 and should be removed.

github-actions · 2025-11-12T15:05:38Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2025-11-14T03:31:51Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

github-actions · 2025-11-14T07:48:21Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: leo-pony <[email protected]>

…tructured outputs compatibility#26866 Signed-off-by: leo-pony <[email protected]>

Signed-off-by: 22dimensions <[email protected]>

Signed-off-by: leo-pony <[email protected]>

Signed-off-by: 22dimensions <[email protected]>

Signed-off-by: leo-pony <[email protected]>

Signed-off-by: 22dimensions <[email protected]>

Signed-off-by: leo-pony <[email protected]>

github-actions bot added documentation Improvements or additions to documentation module:tests module:ops labels Nov 12, 2025

gemini-code-assist bot reviewed Nov 12, 2025

View reviewed changes

github-actions bot added module:core merge-conflicts labels Nov 12, 2025

leo-pony force-pushed the update_newest branch from e0ab650 to 8b7a437 Compare November 13, 2025 02:33

github-actions bot removed module:tests merge-conflicts labels Nov 13, 2025

leo-pony force-pushed the update_newest branch from 8b7a437 to ae349dc Compare November 13, 2025 02:48

github-actions bot added module:tests and removed documentation Improvements or additions to documentation module:ops labels Nov 13, 2025

22dimensions force-pushed the update_newest branch from bb66034 to 18b71a3 Compare November 13, 2025 07:53

github-actions bot added the documentation Improvements or additions to documentation label Nov 13, 2025

22dimensions force-pushed the update_newest branch from f2c1b1e to 4be60d9 Compare November 13, 2025 11:50

leo-pony force-pushed the update_newest branch from 8e5cfbf to 3cb2af9 Compare November 13, 2025 13:26

22dimensions added ready-for-test start test by label for PR ready read for review labels Nov 14, 2025

leo-pony force-pushed the update_newest branch from e07fff6 to 48de24c Compare November 14, 2025 03:17

22dimensions force-pushed the update_newest branch from 48de24c to 839dd89 Compare November 14, 2025 03:21

github-actions bot added the merge-conflicts label Nov 14, 2025

leo-pony force-pushed the update_newest branch from 3ac8c18 to 5348f4d Compare November 14, 2025 03:41

github-actions bot removed the merge-conflicts label Nov 14, 2025

leo-pony force-pushed the update_newest branch from 5348f4d to 8c90757 Compare November 14, 2025 03:45

github-actions bot added the merge-conflicts label Nov 14, 2025

leo-pony and others added 18 commits November 14, 2025 17:51

fix break by vllm commit: Support LoRA with speculative decoding #21068

c70ee9e

Signed-off-by: leo-pony <[email protected]>

[Hybrid] Pass kernel block size to builders #27753

1cb76fb

Signed-off-by: leo-pony <[email protected]>

fix the main-to-main break by:[Bug] Fix env string 0 same to True #28159

9ff893b

Signed-off-by: leo-pony <[email protected]>

[Core] Async scheduling + structured outputs compatibility#26866

c08e4fe

Signed-off-by: leo-pony <[email protected]>

fix structure output break bduring adapt to llm: Async scheduling + s…

dae0df8

…tructured outputs compatibility#26866 Signed-off-by: leo-pony <[email protected]>

fix structured outputs compatibility

b7c5978

Signed-off-by: 22dimensions <[email protected]>

fix mtp breaks in modelrunner and format fix

60412d0

Signed-off-by: leo-pony <[email protected]>

fix mypy issues

948f2e1

Signed-off-by: leo-pony <[email protected]>

model runner execute model support v0.11.0 branch

2741144

Signed-off-by: leo-pony <[email protected]>

fix format issue

2946ffd

Signed-off-by: leo-pony <[email protected]>

update to releases/v0.11.1

153da1a

Signed-off-by: 22dimensions <[email protected]>

fix break by vllm:[BugFix][VL] Fix FA selection on Qwen2.5-VL #27790

5ad73e8

Signed-off-by: leo-pony <[email protected]>

fix scheduler

72d75fe

Signed-off-by: 22dimensions <[email protected]>

skip ut, nightly, v0.11.0

3d17631

Signed-off-by: leo-pony <[email protected]>

Skip 1th e2e full

49a9725

Signed-off-by: leo-pony <[email protected]>

skip has tested cases

e626367

Signed-off-by: leo-pony <[email protected]>

Fix vllm break:Support LoRA with speculative decoding:#21068

557aa1f

Signed-off-by: leo-pony <[email protected]>

remove skip of nightly a2

26a80b4

Signed-off-by: leo-pony <[email protected]>

leo-pony force-pushed the update_newest branch from 63c97bd to 26a80b4 Compare November 14, 2025 11:58

github-actions bot removed the merge-conflicts label Nov 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Update newest #4142

[WIP] Update newest #4142

22dimensions commented Nov 12, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 12, 2025

Uh oh!

gemini-code-assist bot Nov 12, 2025

Uh oh!

gemini-code-assist bot Nov 12, 2025

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

github-actions bot commented Nov 14, 2025

Uh oh!

github-actions bot commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		from vllm.v1.kv_cache_interface import FullAttentionSpec, MambaSpec

		from vllm_ascend.utils import vllm_version_is

[WIP] Update newest #4142

Are you sure you want to change the base?

[WIP] Update newest #4142

Conversation

22dimensions commented Nov 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

github-actions bot commented Nov 14, 2025

Uh oh!

github-actions bot commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

22dimensions commented Nov 12, 2025 •

edited by github-actions bot

Loading