forked from opendatahub-io/vllm
-
Notifications
You must be signed in to change notification settings - Fork 15
sync with nm-vllm-ent @ v0.8.3.0-rc0 #103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
ckhordiasma
merged 1,003 commits into
red-hat-data-services:rhoai-2.20
from
dtrifiro:rhoai-2.19-sync-with-midstream-0.8.3.0
Apr 18, 2025
Merged
sync with nm-vllm-ent @ v0.8.3.0-rc0 #103
ckhordiasma
merged 1,003 commits into
red-hat-data-services:rhoai-2.20
from
dtrifiro:rhoai-2.19-sync-with-midstream-0.8.3.0
Apr 18, 2025
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Signed-off-by: youkaichao <[email protected]>
…ect#15544) Signed-off-by: Harry Mellor <[email protected]>
…interface (vllm-project#15317) Signed-off-by: Aaron Pham <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: Marko Rosenmueller <[email protected]>
Signed-off-by: Alexei V. Ivanov <[email protected]>
Signed-off-by: Alexander Matveev <[email protected]>
Signed-off-by: Alexander Matveev <[email protected]>
Signed-off-by: Chengji Yao <[email protected]>
Signed-off-by: Matthew Vine <[email protected]>
Signed-off-by: ElizaWszola <[email protected]> Signed-off-by: ElizaWszola <[email protected]> Co-authored-by: Lucas Wilkinson <[email protected]>
Signed-off-by: Chenyaaang <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]> Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: weizeng <[email protected]>
Signed-off-by: Mengqing Cao <[email protected]>
Signed-off-by: Cody Yu <[email protected]> Co-authored-by: Woosuk Kwon <[email protected]>
…s-patch-3 add retries and get rid of progress meter
Signed-off-by: Gregory Shtrasberg <[email protected]>
…QS (vllm-project#15583) Signed-off-by: Chengji Yao <[email protected]>
Signed-off-by: Bella kira <[email protected]>
…oject#15587) Signed-off-by: ElizaWszola <[email protected]> Signed-off-by: ElizaWszola <[email protected]> Signed-off-by: [email protected] <[email protected]> Co-authored-by: ElizaWszola <[email protected]> Co-authored-by: Lucas Wilkinson <[email protected]> Co-authored-by: ElizaWszola <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
…ect#14917, but for ROCm only (vllm-project#15413) Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: kevin <[email protected]>
"variables" in `docker-bake.hcl` can have defaults, but are overridden
by env vars with the same name. We can remove these (useless) defaults
and fix the name for `GITHUB_REPO` (it's actually `GITHUB_REPOSITORY`)
Example:
```bash
env \
GITHUB_REPOSITORY=neuralmagic/nm-vllm-ent \
PYTHON_VERSION=3.12 \
GITHUB_SHA=$(git rev-parse HEAD) \
VLLM_VERSION=0.8.3 \
docker buildx bake cuda --print
```
output:
```json
{
"group": {
"default": {
"targets": [
"cuda"
]
}
},
"target": {
"cuda": {
"context": ".",
"dockerfile": "Dockerfile.ubi",
"args": {
"BASE_UBI_IMAGE_TAG": "9.5-1739420147",
"FLASHINFER_VERSION": "https://github.com/flashinfer-ai/flashinfer/releases/download/v0.2.1.post1/flashinfer_python-0.2.1.post1+cu124torch2.5-cp38-abi3-linux_x86_64.whl",
"LIBSODIUM_VERSION": "1.0.20",
"PYTHON_VERSION": "3.12",
"VLLM_TGIS_ADAPTER_VERSION": "0.6.3"
},
"labels": {
"org.opencontainers.image.source": "https://github.com/neuralmagic/nm-vllm-ent",
"vcs-ref": "9803ee1c6d30330c9dc3fca6d42491794f135013",
"vcs-type": "git"
},
"tags": [
"quay.io/vllm/vllm:0.8.3",
"quay.io/vllm/vllm:9803ee1c6d30330c9dc3fca6d42491794f135013",
"quay.io/vllm/vllm:2025-04-04-17-55"
],
"platforms": [
"linux/amd64"
]
}
}
}
```
vllm-project#15906) Signed-off-by: Nick Hill <[email protected]>
…t#15712) Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: mgoin <[email protected]> Signed-off-by: Roger Wang <[email protected]> Co-authored-by: mgoin <[email protected]> Co-authored-by: Roger Wang <[email protected]>
…6071) Signed-off-by: Bill Nell <[email protected]>
Signed-off-by: Michael Goin <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
…put queue (vllm-project#15906)" This reverts commit 651cf0f.
This comment was marked as outdated.
This comment was marked as outdated.
|
made a copy of this PR with included PR pipeline here - #107 |
ckhordiasma
approved these changes
Apr 18, 2025
b63f4cb
into
red-hat-data-services:rhoai-2.20
2 of 3 checks passed
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Sync with https://github.com/neuralmagic/nm-vllm-ent/releases/tag/v0.8.3.0-rc
0.7.0