Skip to content

Commit 9d2b4a7

Browse files
authored
[V1][Metrics] Updated list of deprecated metrics in v0.8 (#14695)
Signed-off-by: Mark McLoughlin <[email protected]>
1 parent 0b0d642 commit 9d2b4a7

File tree

1 file changed

+10
-1
lines changed

1 file changed

+10
-1
lines changed

docs/source/serving/metrics.md

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,16 @@ The following metrics are exposed:
3939

4040
The following metrics are deprecated and due to be removed in a future version:
4141

42-
- *(No metrics are currently deprecated)*
42+
- `vllm:num_requests_swapped`, `vllm:cpu_cache_usage_perc`, and
43+
`vllm:cpu_prefix_cache_hit_rate` because KV cache offloading is not
44+
used in V1.
45+
- `vllm:gpu_prefix_cache_hit_rate` is replaced by queries+hits
46+
counters in V1.
47+
- `vllm:time_in_queue_requests` because it duplicates
48+
`vllm:request_queue_time_seconds`.
49+
- `vllm:model_forward_time_milliseconds` and
50+
`vllm:model_execute_time_milliseconds` because
51+
prefill/decode/inference time metrics should be used instead.
4352

4453
Note: when metrics are deprecated in version `X.Y`, they are hidden in version `X.Y+1`
4554
but can be re-enabled using the `--show-hidden-metrics-for-version=X.Y` escape hatch,

0 commit comments

Comments
 (0)