Skip to content
This repository was archived by the owner on Sep 4, 2025. It is now read-only.

Conversation

@dtrifiro
Copy link

No description provided.

maxdebayser and others added 30 commits October 8, 2024 10:58
Signed-off-by: Max de Bayser <[email protected]>
Co-authored-by: sanghol <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
@openshift-ci
Copy link

openshift-ci bot commented Nov 13, 2024

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@openshift-ci
Copy link

openshift-ci bot commented Nov 13, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dtrifiro

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@dtrifiro dtrifiro marked this pull request as ready for review November 13, 2024 10:48
@dtrifiro dtrifiro requested a review from njhill as a code owner November 13, 2024 10:48
@dtrifiro dtrifiro merged commit 9ac4882 into release Nov 13, 2024
0 of 7 checks passed
groenenboomj pushed a commit that referenced this pull request Feb 24, 2025
vaibhavjainwiz pushed a commit to vajain-rhods/vllm that referenced this pull request Jun 4, 2025
## Description
Syncing to upstream vLLM upstream tag
[v0.9.0](https://github.com/vllm-project/vllm/tree/v0.9.0)
This PR does *not* target any tag upstream as of now. 
Builds on top of:
neuralmagic/nm-vllm-ent@eea2469

Git log:
```
commit 922878c (HEAD -> upstream-sync-2025-05-21-notag, nm-fork/upstream-sync-2025-05-21-notag)
Merge: a6275cd 5873877
Author: Selbi Nuryyeva <[email protected]>
Date:   Wed May 28 16:21:22 2025 -0400

    Merge tag 'v0.9.0' into upstream-sync-2025-05-21-notag

commit 5873877 (tag: v0.9.0, upstream/releases/v0.9.0)
Author: Michael Goin <[email protected]>
Date:   Tue May 27 12:05:37 2025 -0400

    [Bugfix] Mistral tool calling when content is list (vllm-project#18729)
    
    Signed-off-by: mgoin <[email protected]>

commit 696259c
Author: Cyrus Leung <[email protected]>
Date:   Tue May 27 23:45:48 2025 +0800

    [Core] Automatically cast multi-modal input dtype (vllm-project#18756)
    
    Signed-off-by: DarkLight1337 <[email protected]>

commit 6b6d496
Author: chunxiaozheng <[email protected]>
Date:   Tue May 27 21:08:44 2025 +0800

    optimize get_kv_cache_torch_dtype (vllm-project#18531)
    
    Signed-off-by: idellzheng <[email protected]>

commit aaa4ac1
Author: cascade <[email protected]>
Date:   Tue May 27 05:06:34 2025 -0700

    Disable prefix cache by default for benchmark (vllm-project#18639)
    
    Signed-off-by: cascade812 <[email protected]>

commit 06a0338
Author: Mark McLoughlin <[email protected]>
Date:   Tue May 27 10:37:06 2025 +0100

    [V1][Metrics] Add API for accessing in-memory Prometheus metrics (vllm-project#17010)
    
    Signed-off-by: Mark McLoughlin <[email protected]>
```
Commands
```
git fetch upstream --tags
git checkout -b upstream-sync-2025-05-21-notag
git merge v0.9.0
```


## Testing

[accept-sync](https://github.com/neuralmagic/nm-cicd/actions/runs/15309901499)
run

Notes: couple lm-evals are failing but that's because model locations
are not fully updated and some model card numbers need updating because
of that. one unit test is failing due to flashinfer version. These
updates will be done in a separate PR in nm-cicd repo. The merge is good
to go.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.