File tree Expand file tree Collapse file tree 1 file changed +10
-1
lines changed Expand file tree Collapse file tree 1 file changed +10
-1
lines changed Original file line number Diff line number Diff line change @@ -39,7 +39,16 @@ The following metrics are exposed:
3939
4040The following metrics are deprecated and due to be removed in a future version:
4141
42- - * (No metrics are currently deprecated)*
42+ - ` vllm:num_requests_swapped ` , ` vllm:cpu_cache_usage_perc ` , and
43+ ` vllm:cpu_prefix_cache_hit_rate ` because KV cache offloading is not
44+ used in V1.
45+ - ` vllm:gpu_prefix_cache_hit_rate ` is replaced by queries+hits
46+ counters in V1.
47+ - ` vllm:time_in_queue_requests ` because it duplicates
48+ ` vllm:request_queue_time_seconds ` .
49+ - ` vllm:model_forward_time_milliseconds ` and
50+ ` vllm:model_execute_time_milliseconds ` because
51+ prefill/decode/inference time metrics should be used instead.
4352
4453Note: when metrics are deprecated in version ` X.Y ` , they are hidden in version ` X.Y+1 `
4554but can be re-enabled using the ` --show-hidden-metrics-for-version=X.Y ` escape hatch,
You can’t perform that action at this time.
0 commit comments