Upgrade to 0.11.1 newest vllm commit #3762

wxsIcey · 2025-10-25T08:33:22Z

What this PR does / why we need it?

vllm-project/vllm@83f478b

Fix spec decode rejection sampler, caused by vllm-project/vllm#26060
Fix some import, caused by vllm-project/vllm#27374
Fix scheduler_config.send_delta_data, caused by #3719
Fix init_with_cudagraph_sizes, caused by vllm-project/vllm#26016
Fix vl modelof replacing PatchEmbed's conv3d to linear layer, caused by vllm-project/vllm#27418

Does this PR introduce any user-facing change?

N/A

How was this patch tested?

CI passed with new added/existing test.

vLLM version: v0.11.0rc3
vLLM main: vllm-project/vllm@83f478b

github-actions · 2025-10-25T08:38:04Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

.github/workflows/format_pr_body.yaml

Signed-off-by: Icey <[email protected]>

wxsIcey · 2025-10-27T11:03:07Z

Here we first fix spec decoding, return logprobs for spec decoding can be a future work.

Signed-off-by: Icey <[email protected]>

wxsIcey · 2025-10-27T13:17:25Z

test_embedding_aclgraph.py is skipped

Signed-off-by: Icey <[email protected]>

wangxiyuan · 2025-10-28T06:45:17Z

vllm_ascend/utils.py

    return max(layer_counts)


+# Update cudagraph capture sizes for vllm config


this is maybe not correct. I'll look more

wangxiyuan · 2025-10-28T06:54:52Z

vllm_ascend/platform.py

+            if vllm_version_is("0.11.0"):
+                if not model_config.is_multimodal_model and \
+                    structured_outputs_config.backend == "auto" and \
+                    not scheduler_config.send_delta_data and \


getattr(scheduler_config, "send_delta_data", False)

wxsIcey marked this pull request as ready for review October 25, 2025 08:43

wxsIcey added ready read for review ready-for-test start test by label for PR labels Oct 25, 2025

github-actions bot added the module:core label Oct 25, 2025

wangxiyuan reviewed Oct 25, 2025

View reviewed changes

.github/workflows/format_pr_body.yaml Show resolved Hide resolved

github-actions bot added module:tests and removed module:tests labels Oct 27, 2025

wxsIcey added 6 commits October 27, 2025 08:38

Upgrade to 0.11.1 newest vllm commit

7c50b3e

Signed-off-by: Icey <[email protected]>

change commit and fix send_delta_data

94c9125

Signed-off-by: Icey <[email protected]>

fix init_with_cudagraph_sizes

c2dc165

Signed-off-by: Icey <[email protected]>

skit embed aclgraph e2e

6ba3f39

Signed-off-by: Icey <[email protected]>

fix init_with_cudagraph_sizes

e8849b4

Signed-off-by: Icey <[email protected]>

change commit id to 0.11.1

0ca98f5

Signed-off-by: Icey <[email protected]>

wxsIcey force-pushed the 0.11.1_1025 branch from ecc96bb to 0ca98f5 Compare October 27, 2025 08:38

tiny fix

e8f87f6

Signed-off-by: Icey <[email protected]>

wxsIcey added 3 commits October 27, 2025 11:03

fix eagle

8e82843

Signed-off-by: Icey <[email protected]>

fix aclgraph

c166742

Signed-off-by: Icey <[email protected]>

skip test_embedding_aclgraph test

89db007

Signed-off-by: Icey <[email protected]>

wxsIcey added 3 commits October 27, 2025 13:30

tiny fix

28b1306

Signed-off-by: Icey <[email protected]>

fix vl

445650b

Signed-off-by: Icey <[email protected]>

tiny fix

5b09cc0

Signed-off-by: Icey <[email protected]>

wangxiyuan approved these changes Oct 28, 2025

View reviewed changes

wangxiyuan reviewed Oct 28, 2025

View reviewed changes

wangxiyuan merged commit a7450db into vllm-project:main Oct 28, 2025
24 checks passed

wxsIcey mentioned this pull request Oct 28, 2025

[Wip] fix problems introduced by vllm #26016 #3826

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Upgrade to 0.11.1 newest vllm commit #3762

Upgrade to 0.11.1 newest vllm commit #3762

Uh oh!

wxsIcey commented Oct 25, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 25, 2025

Uh oh!

Uh oh!

wxsIcey commented Oct 27, 2025

Uh oh!

wxsIcey commented Oct 27, 2025

Uh oh!

wangxiyuan Oct 28, 2025

Uh oh!

wangxiyuan Oct 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return max(layer_counts)


		# Update cudagraph capture sizes for vllm config

Upgrade to 0.11.1 newest vllm commit #3762

Upgrade to 0.11.1 newest vllm commit #3762

Uh oh!

Conversation

wxsIcey commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 25, 2025

Uh oh!

Uh oh!

wxsIcey commented Oct 27, 2025

Uh oh!

wxsIcey commented Oct 27, 2025

Uh oh!

wangxiyuan Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

wangxiyuan Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wxsIcey commented Oct 25, 2025 •

edited

Loading