Llama Stack Performance Analysis: Benchmarking with vLLM Inference Engine #3515

leseb · 2025-09-22T07:08:16Z

leseb
Sep 22, 2025
Collaborator

Red Hat has been conducting performance evaluations of Llama Stack with vLLM. The testing was carried out by @tosokin, and I’m simply building on her work. As a result of these evaluations, a few issues have been opened.

Today, we’d like to share the results more widely and invite your comments and feedback.
Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Llama Stack Performance Analysis: Benchmarking with vLLM Inference Engine #3515

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Llama Stack Performance Analysis: Benchmarking with vLLM Inference Engine #3515

Uh oh!

leseb Sep 22, 2025 Collaborator

Replies: 0 comments

leseb
Sep 22, 2025
Collaborator