[ROCm][Bugfix] Bring back fallback to eager mode removed in #14917, but for ROCm only #15413

gshtras · 2025-03-24T19:43:58Z

mllama doesn't work in graph mode on ROCm.
Returning the condition removed in #14917 conditionally for ROCm platform until this is figured out to bring back support for LLama 3.2

…for ROCm only Signed-off-by: Gregory Shtrasberg <[email protected]>

github-actions · 2025-03-24T19:44:06Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

SageMoore

Looks reasonable. Just one NIT on a variable name.

SageMoore · 2025-04-01T15:32:16Z

vllm/config.py

            self.max_seq_len_to_capture = self.max_model_len
        self.max_seq_len_to_capture = min(self.max_seq_len_to_capture,
                                          self.max_model_len)
+        MODEL_NOT_SUPPORT_CUDA_GRAPH_ROCM = ['mllama']


Nit: can you change this to "unsupported_models".

Signed-off-by: Gregory Shtrasberg <[email protected]>

…ect#14917, but for ROCm only (vllm-project#15413) Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: xinyuxiao <[email protected]>

…ect#14917, but for ROCm only (vllm-project#15413) Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

…ect#14917, but for ROCm only (vllm-project#15413) Signed-off-by: Gregory Shtrasberg <[email protected]>

…ect#14917, but for ROCm only (vllm-project#15413) Signed-off-by: Gregory Shtrasberg <[email protected]> Signed-off-by: Mu Huai <[email protected]>

Bring back fallback to eager mode removed in vllm-project#14917, but …

661cc36

…for ROCm only Signed-off-by: Gregory Shtrasberg <[email protected]>

gshtras mentioned this pull request Mar 24, 2025

Upstream merge 2025 03 24 ROCm/vllm#489

Merged

gshtras mentioned this pull request Mar 31, 2025

[Bugfix] Handle process_weights_after_loading for QKVCrossParallelLinear #15328

Merged

Merge remote-tracking branch 'origin/main' into mllama_rocm_eager

301c7e7

SageMoore reviewed Apr 1, 2025

View reviewed changes

Rename variable

470f638

Signed-off-by: Gregory Shtrasberg <[email protected]>

gshtras mentioned this pull request Apr 1, 2025

[ROCm][Bugfix][FP8] Setting scales for the unquantized layers of a partially quantized model #15900

Closed

robertgshaw2-redhat enabled auto-merge (squash) April 3, 2025 15:33

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 3, 2025

robertgshaw2-redhat approved these changes Apr 3, 2025

View reviewed changes

mgoin approved these changes Apr 3, 2025

View reviewed changes

vllm-bot merged commit a6d042d into vllm-project:main Apr 4, 2025
48 of 50 checks passed

gshtras deleted the mllama_rocm_eager branch April 7, 2025 14:58

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[ROCm][Bugfix] Bring back fallback to eager mode removed in vllm-proj…

918a6de

…ect#14917, but for ROCm only (vllm-project#15413) Signed-off-by: Gregory Shtrasberg <[email protected]>

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

[ROCm][Bugfix] Bring back fallback to eager mode removed in vllm-proj…

3a3f009

…ect#14917, but for ROCm only (vllm-project#15413) Signed-off-by: Gregory Shtrasberg <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[ROCm][Bugfix] Bring back fallback to eager mode removed in #14917, but for ROCm only #15413

[ROCm][Bugfix] Bring back fallback to eager mode removed in #14917, but for ROCm only #15413

Uh oh!

gshtras commented Mar 24, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Mar 24, 2025

Uh oh!

SageMoore left a comment

Uh oh!

SageMoore Apr 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[ROCm][Bugfix] Bring back fallback to eager mode removed in #14917, but for ROCm only #15413

[ROCm][Bugfix] Bring back fallback to eager mode removed in #14917, but for ROCm only #15413

Uh oh!

Conversation

gshtras commented Mar 24, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 24, 2025

Uh oh!

SageMoore left a comment

Choose a reason for hiding this comment

Uh oh!

SageMoore Apr 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gshtras commented Mar 24, 2025 •

edited by github-actions bot

Loading