[KV Connector] Make KVCacheConfig an explicit constructor argument #27887

markmc · 2025-10-31T15:34:30Z

Follow on from #25712

VllmConfig is explicitly designed as a dataclass containing user-provided configuration and model metadata. It is a global configuration object that lives throughout the entire engine lifetime and is meant to be immutable after __post_init__().

KVCacheConfig is worker-specific, runtime-computed state. It has limited lifetime, and its purpose is limited to initializing the KV Cache in the model runner.

Even if we add KV cache hints to model config.json in future, this would be parsed into ModelConfig, used as input to the get_kv_cache_configs() computation, and the resulting KVCacheConfig would still be runtime state.

We are currently creating per-worker copies of VllmConfig in order to attach the runtime KVCacheConfig state. But instead we should just explicitly pass KVCacheConfig to the connector.

Make sure to handle backwards compatibility for external connector implementations (loaded via module path) that have the old style constructor signature.

gemini-code-assist

Code Review

This pull request refactors KV connector instantiation to explicitly pass KVCacheConfig, which is a good improvement for separating runtime state from global configuration. The implementation of backward compatibility for external connectors is also well-handled in the factory. However, I've identified a critical issue in MultiConnector where it incorrectly instantiates its sub-connectors, which breaks backward compatibility and fails to pass the kv_cache_config. I've also found a minor issue in a test utility. Please see my comments for details and suggestions.

vllm/distributed/kv_transfer/kv_connector/v1/multi_connector.py

tests/v1/kv_connector/unit/utils.py

markmc · 2025-10-31T15:37:26Z

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

vllm/v1/core/sched/scheduler.py

markmc · 2025-10-31T15:39:57Z

xref #27811

/cc @KuntaiDu @njhill @heheda12345

Follow on from vllm-project#25712 `VllmConfig` is explicitly designed as a dataclass containing user-provided configuration and model metadata. It is a global configuration object that lives throughout the entire engine lifetime and is meant to be immutable after `__post_init__()`. `KVCacheConfig` is worker-specific, runtime-computed state. It has limited lifetime, and its purpose is limited to initializing the KV Cache in the model runner. Even if we add KV cache hints to model config.json in future, this would be parsed into `ModelConfig`, used as input to the `get_kv_cache_configs()` computation, and the resulting `KVCacheConfig` would still be runtime state. We are currently creating per-worker copies of VllmConfig in order to attach the runtime `KVCacheConfig` state. But instead we should just explicitly pass `KVCacheConfig` to the connector. Make sure to handle backwards compatibility for external connector implementations (loaded via module path) that have the old style constructor signature. Signed-off-by: Mark McLoughlin <[email protected]>

KuntaiDu

In general LGTM. Some nits listed in comments.

KuntaiDu · 2025-11-03T05:55:53Z

vllm/distributed/kv_transfer/kv_connector/factory.py

                    f"Class {connector_name} not found in {connector_module_path}"
                ) from e
            connector_cls = cast(type[KVConnectorBaseType], connector_cls)
+            if not supports_kw(connector_cls, "kv_cache_config"):


Just to confirm: this means that we allow connector to include kv_cache_config field as init args even when it does not support hybrid allocator (not a subclass of SupportsHMA)?

Yeah. Since we we are currently unconditionally attaching KVCacheConfig to VllmConfig, I didn't even consider making it conditional until now

If we think that KVCacheConfig will only be used as part of implementing request_finished_all_groups() we could add init_hma() or init_kv_cache_config() to SupportsHMA ?

KVCacheConfig takes effect on almost all connector methods when HMA is enabled and it is not useful at all when HMA is disabled.

That said, the current implementation looks good to me as it is simple enough.

KuntaiDu · 2025-11-03T06:00:27Z

tests/v1/kv_connector/unit/test_backwards_compatibility.py

@@ -0,0 +1,275 @@
+# SPDX-License-Identifier: Apache-2.0


Maybe rename this file? Like test_connector_init_with_kv_cache_config or something.

Obviously I don't really mind renaming it, if it helps, but my thinking is that these tests are about testing support for connectors that have not yet been updated to the new signature so it's more like "without_kv_cache_config()"

Basically, because all connectors are expected to support the new signature, we'll soon see these tests as old cruft that we need to keep around for a while

(This is different from the SupportsHMA approach - in that case, maybe only a small subset of connectors would be updated to take KVCacheConfig)

markmc · 2025-11-03T07:41:59Z

Just capturing some of the points from the Slack discussion, since I think it was very useful 👍

@heheda12345

I think in the long term, kv cache config will be part of vllm config. But for now, as it is not created during vllm_config initialization, my concern is that there are multiple copies of vllm_config inside vllm, and it's difficult to keep them synchronized. So I suggest @KuntaiDu to add it only as a special attribute for connector

But I don't want something like init(vllm_config, kv_cache_config) as kv_cache_config may be part of vllm_config in the future.

Even if we add KV cache hints to model config.json in future, this would be parsed into ModelConfig, used as input to the get_kv_cache_configs() computation, and the resulting KVCacheConfig would still be runtime state.

if we can do get_kv_cache_configs(model_config), then kv_cache_config can be init during config resolution and is immutable

@markmc

In future, KVCacheConfig will only ever contain immutable config from user/model, and it will no longer ever contain runtime computed state?

IMHO we should not attach runtime computed state to VllmConfig - it makes sense to have KVConnector.init(VllmConfig, KVCacheConfig) now, and work towards making KVCacheConfig only immutable config and then deprecating the KVCacheConfig argument

@heheda12345

should we keep connector interface as stable as possible?

@markmc

yes, but IMO "stability" means "backwards compatible for a (potentially long) deprecation period"

NickLucche

lgtm, let's keep compat code under control though

heheda12345 · 2025-11-04T04:54:52Z

vllm/distributed/kv_transfer/kv_connector/v1/lmcache_connector.py

+        self,
+        vllm_config: "VllmConfig",
+        role: KVConnectorRole,
+        kv_cache_config: "KVCacheConfig",


@KuntaiDu do you need to change LMCache connector? kv_cache_config is not available in vllm config now.

I need to change LMCache PR correspondingly, but we can merge this PR first and I can change it in LMCache afterwards.

heheda12345

LGTM! Leave this PR to @KuntaiDu for compatibility with LMCache + HMA

KuntaiDu · 2025-11-04T07:00:45Z

LGTM! Leave this PR to @KuntaiDu for compatibility with LMCache + HMA

Thanks for reminder! I will fix LMCache compatibility after this PR is merged (still a lot of work on LMCache side before turning on HMA by default for LMCache).

…llm-project#27887) Signed-off-by: Mark McLoughlin <[email protected]>

markmc requested review from ApostaC, NickLucche, WoosukKwon, alexm-redhat, comaniac, heheda12345, njhill, robertgshaw2-redhat and ywang96 as code owners October 31, 2025 15:34

mergify bot added v1 kv-connector labels Oct 31, 2025

gemini-code-assist bot reviewed Oct 31, 2025

View reviewed changes

vllm/distributed/kv_transfer/kv_connector/v1/multi_connector.py Show resolved Hide resolved

tests/v1/kv_connector/unit/utils.py Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Oct 31, 2025

View reviewed changes

vllm/v1/core/sched/scheduler.py Show resolved Hide resolved

markmc requested a review from KuntaiDu October 31, 2025 15:40

markmc force-pushed the connector-kv-cache-config branch from 4513b7c to 15ec91b Compare October 31, 2025 15:41

markmc force-pushed the connector-kv-cache-config branch from 15ec91b to fdc30ae Compare October 31, 2025 17:24

markmc added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 31, 2025

KuntaiDu approved these changes Nov 3, 2025

View reviewed changes

NickLucche approved these changes Nov 3, 2025

View reviewed changes

heheda12345 reviewed Nov 4, 2025

View reviewed changes

heheda12345 approved these changes Nov 4, 2025

View reviewed changes

KuntaiDu merged commit 58279c6 into vllm-project:main Nov 4, 2025
54 checks passed

omerpaz95 pushed a commit to omerpaz95/vllm that referenced this pull request Nov 4, 2025

[KV Connector] Make KVCacheConfig an explicit constructor argument (v…

546cfe6

…llm-project#27887) Signed-off-by: Mark McLoughlin <[email protected]>

Uh oh!

[KV Connector] Make KVCacheConfig an explicit constructor argument #27887

[KV Connector] Make KVCacheConfig an explicit constructor argument #27887

Uh oh!

Conversation

markmc commented Oct 31, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

markmc commented Oct 31, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

markmc commented Oct 31, 2025

Uh oh!

KuntaiDu left a comment

Choose a reason for hiding this comment

Uh oh!

KuntaiDu Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

markmc Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

KuntaiDu Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

KuntaiDu Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

markmc Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

markmc commented Nov 3, 2025

Uh oh!

NickLucche left a comment

Choose a reason for hiding this comment

Uh oh!

heheda12345 Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

KuntaiDu Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

heheda12345 left a comment

Choose a reason for hiding this comment

Uh oh!

KuntaiDu commented Nov 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

markmc commented Oct 31, 2025 •

edited by github-actions bot

Loading