Adds anthropic /v1/messages endpoint to openai api_server #27882

bbartels · 2025-10-31T15:00:23Z

Purpose

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: bbartels <[email protected]>

gemini-code-assist

Code Review

This pull request introduces a new endpoint to support Anthropic's Messages API, which is a valuable addition. The implementation involves creating a new API endpoint and a serving layer to translate between Anthropic and OpenAI protocols, along with corresponding tests. My review has identified a few critical and high-severity issues that should be addressed. These include a bug in the tests that prevents a test case from running, a potential server crash due to a null reference in the new API endpoint, incorrect error response formatting, and missing error handling. I've provided detailed comments and suggestions for each of these points.

tests/entrypoints/openai/test_messages.py

vllm/entrypoints/openai/api_server.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm/entrypoints/openai/api_server.py

Signed-off-by: bbartels <[email protected]>

bbartels · 2025-10-31T15:55:35Z

/gemini review

Signed-off-by: bbartels <[email protected]>

gemini-code-assist

Code Review

This pull request adds support for the Anthropic /v1/messages endpoint to the OpenAI-compatible API server. The implementation cleverly reuses the existing chat completion logic by creating an adapter layer that translates requests and responses between Anthropic and OpenAI formats. The changes to the API server and the addition of the serving logic are well-structured. However, the newly added test file introduces significant code duplication, which could pose maintenance challenges. I've provided a suggestion to refactor the tests for better maintainability.

tests/entrypoints/openai/test_messages.py

bbartels · 2025-10-31T15:59:17Z

Ready for review, not sure if we should just get rid of the separate anthropic api_server as this doesn't add much over just having the openai one support anthropic's /v1/messages endpoint

Signed-off-by: bbartels <[email protected]>

vllm/entrypoints/openai/api_server.py

Signed-off-by: Benjamin Bartels <[email protected]>

DarkLight1337

I think this change is OK for now, but in the long term, the code becomes less maintainable as more APIs need to be supported. It is not ideal that the "OpenAI" server includes code from other APIs.

After this PR, I propose restructuring the code for online serving into the following modules:

vllm.entrypoints.serve.core: Contains the code for setting up the async client and FastAPI app. May include some common APIs such as health check.
vllm.entrypoints.serve.openai: Contains only the code for OpenAI endpoints (e.g. Completions API, Chat Completions API, Responses API)
vllm.entrypoints.serve.anthropic: Contains only the code for Anthropic endpoints (Messages API)
vllm.entrypoints.serve.jina: Contains only the code for JinaAI endpoints (Reranker API)
vllm.entrypoints.serve.vllm: Contains only the code for vLLM endpoints (e.g. Tokenize API, Pooling API, dev mode endpoints)

In vllm.entrypoints.serve, we can have the actual entrypoint which uses .core to build the server, then incrementally attach endpoints to the FastAPI app by importing relevant functions from the API-specific submodules.

mgoin · 2025-11-01T09:26:35Z

I agree with @DarkLight1337, especially as we need to add other endpoints from the Anthropic api in order to work well with Claude Code such as /v1/messages/count_tokens

vllm serve should still be the main interface to access all the different endpoints, but we are long overdue restructuring the code to identify what endpoints are mention to match the OpenAI standard and what are alternatives.

For now, I think we should land this addition and delete the explicit anthropic serve command and code. That code has only been in main for a week, so better to delete it quickly than introduce it into a release just to deprecate. Then open a PR/issue to restructure the code afterwards.

DarkLight1337 · 2025-11-01T10:00:08Z

tests/entrypoints/openai/test_messages.py

+
+
+@pytest.mark.asyncio
+async def test_anthropic_tool_call_streaming(client: anthropic.AsyncAnthropic):


In that case let's not copy the tests like this to avoid duplication in CI. We just need to check that the endpoint exists for OpenAI server since the same code is used to process the endpoint

Signed-off-by: bbartels <[email protected]>

bbartels · 2025-11-01T11:03:11Z

In full agreement with you both :) Hence why i was rushing to make this change, seems more sustainable in the long run than to maintain two separate api servers.

bbartels · 2025-11-01T11:03:44Z

@mgoin @DarkLight1337 I have removed the old api server now, including the associated tests

Signed-off-by: bbartels <[email protected]>

…ct#27882) Signed-off-by: bbartels <[email protected]> Signed-off-by: Benjamin Bartels <[email protected]>

chaunceyjiang · 2025-11-04T11:18:11Z

I think this change is OK for now, but in the long term, the code becomes less maintainable as more APIs need to be supported. It is not ideal that the "OpenAI" server includes code from other APIs.

After this PR, I propose restructuring the code for online serving into the following modules:

vllm.entrypoints.serve.core: Contains the code for setting up the async client and FastAPI app. May include some common APIs such as health check.

vllm.entrypoints.serve.openai: Contains only the code for OpenAI endpoints (e.g. Completions API, Chat Completions API, Responses API)

vllm.entrypoints.serve.anthropic: Contains only the code for Anthropic endpoints (Messages API)

vllm.entrypoints.serve.jina: Contains only the code for JinaAI endpoints (Reranker API)

vllm.entrypoints.serve.vllm: Contains only the code for vLLM endpoints (e.g. Tokenize API, Pooling API, dev mode endpoints)

In vllm.entrypoints.serve, we can have the actual entrypoint which uses .core to build the server, then incrementally attach endpoints to the FastAPI app by importing relevant functions from the API-specific submodules.

@DarkLight1337 I completely agree — I’ll give the refactor a try. #28040

…ct#27882) Signed-off-by: bbartels <[email protected]> Signed-off-by: Benjamin Bartels <[email protected]>

Adds anthropic messages endpoint to openai server

fd51dad

Signed-off-by: bbartels <[email protected]>

bbartels requested review from DarkLight1337, NickLucche, aarnphm, chaunceyjiang, robertgshaw2-redhat and simon-mo as code owners October 31, 2025 15:00

mergify bot added the frontend label Oct 31, 2025

gemini-code-assist bot reviewed Oct 31, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Oct 31, 2025

View reviewed changes

vllm/entrypoints/openai/api_server.py Outdated Show resolved Hide resolved

vllm/entrypoints/openai/api_server.py Show resolved Hide resolved

bbartels mentioned this pull request Oct 31, 2025

Support Anthropic API /v1/messages Endpoint #22627

Merged

bbartels changed the title ~~Adds anthropic messages endpoint to openai server~~ Adds anthropic /v1/messages endpoint to openai api_server Oct 31, 2025

bbartels added 8 commits October 31, 2025 15:08

Fixes test indent

af5d352

Signed-off-by: bbartels <[email protected]>

Fixes reasoning parser passing

720688e

Signed-off-by: bbartels <[email protected]>

Fixes

b382758

Signed-off-by: bbartels <[email protected]>

Fixes error

ae7538c

Signed-off-by: bbartels <[email protected]>

Fixes

362809a

Signed-off-by: bbartels <[email protected]>

Fixes

8cb90ff

Signed-off-by: bbartels <[email protected]>

Fixes

e039c5f

Signed-off-by: bbartels <[email protected]>

Fixes

d879bbb

Signed-off-by: bbartels <[email protected]>

Fixes

9a7a590

Signed-off-by: bbartels <[email protected]>

gemini-code-assist bot reviewed Oct 31, 2025

View reviewed changes

tests/entrypoints/openai/test_messages.py Show resolved Hide resolved

Fixes formatting

ee81aa4

Signed-off-by: bbartels <[email protected]>

bbartels commented Oct 31, 2025

View reviewed changes

vllm/entrypoints/openai/api_server.py Outdated Show resolved Hide resolved

Update vllm/entrypoints/openai/api_server.py

c2062d9

Signed-off-by: Benjamin Bartels <[email protected]>

mgoin assigned mgoin and DarkLight1337 Nov 1, 2025

DarkLight1337 reviewed Nov 1, 2025

View reviewed changes

Removes old anthropic api_server

0900d87

Signed-off-by: bbartels <[email protected]>

DarkLight1337 approved these changes Nov 1, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) November 1, 2025 13:22

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 1, 2025

Fixes anthropic api tests

8ce5d7d

Signed-off-by: bbartels <[email protected]>

auto-merge was automatically disabled November 1, 2025 16:41
Head branch was pushed to by a user without write access

Merge branch 'main' into anthropic

8233747

simon-mo merged commit 1e88fb7 into vllm-project:main Nov 1, 2025
48 checks passed

zhaozuy pushed a commit to zhaozuy/vllm that referenced this pull request Nov 4, 2025

Adds anthropic /v1/messages endpoint to openai api_server (vllm-proje…

a9f2d34

…ct#27882) Signed-off-by: bbartels <[email protected]> Signed-off-by: Benjamin Bartels <[email protected]>

markmc mentioned this pull request Nov 4, 2025

[WIP][Refactor] [1/N] to simplify the vLLM serving architecture #28040

Draft

5 tasks

mgoin mentioned this pull request Nov 4, 2025

[Feature]: Support Anthropic API /v1/messages endpoint #21313

Closed

1 task

juliendenize pushed a commit to juliendenize/vllm that referenced this pull request Nov 6, 2025

Adds anthropic /v1/messages endpoint to openai api_server (vllm-proje…

cf41726

…ct#27882) Signed-off-by: bbartels <[email protected]> Signed-off-by: Benjamin Bartels <[email protected]>

ZhengHongming888 pushed a commit to ZhengHongming888/vllm that referenced this pull request Nov 8, 2025

Adds anthropic /v1/messages endpoint to openai api_server (vllm-proje…

cfeff8c

…ct#27882) Signed-off-by: bbartels <[email protected]> Signed-off-by: Benjamin Bartels <[email protected]>



		@pytest.mark.asyncio
		async def test_anthropic_tool_call_streaming(client: anthropic.AsyncAnthropic):

Uh oh!

Adds anthropic /v1/messages endpoint to openai api_server #27882

Adds anthropic /v1/messages endpoint to openai api_server #27882

Conversation

bbartels commented Oct 31, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

bbartels commented Oct 31, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

bbartels commented Oct 31, 2025

Uh oh!

Uh oh!

DarkLight1337 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mgoin commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DarkLight1337 Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

bbartels commented Nov 1, 2025

Uh oh!

bbartels commented Nov 1, 2025

Uh oh!

Uh oh!

chaunceyjiang commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bbartels commented Oct 31, 2025 •

edited by github-actions bot

Loading

DarkLight1337 left a comment •

edited

Loading

mgoin commented Nov 1, 2025 •

edited

Loading