Enhance vector store observability support #1262

ThomasVitale · 2024-08-21T20:29:27Z

Consolidate usage of “db.collection.name” attribute to track table name, collection name, index name, document name, or whatever concept a vector database uses to store data. Removed “db.index” that was use sometimes instead of “db.collection.name”. This usage is in line with the OpenTelemetry Semantic Conventions.
Configure query response content to be included as a “span event” instead of a “span attribute” if the backend system supports that, similar to how we do for the model observations.
Structure vector store observation attributes in dedicated enums, including one for the Spring AI Kinds to avoid hard-coding the same value in a lot of places. This follows the OpenTelemetry Semantic Conventions as much as possible. Also, adopt Spring usual non-null-by-default strategy as much as possible.
Align vector store conventions to the chat model ones, and follow alphabetical order for values. This is particularly useful for the convention classes, for which the Micrometer performance of exporting telemetry data improves when key values are added already sorted to the context.
Fix flaky test in Mistral AI.
Improve Qdrant integration tests.

ThomasVitale · 2024-08-22T06:19:12Z

...stral-ai/src/test/java/org/springframework/ai/mistralai/MistralAiChatModelObservationIT.java

 			.hasHighCardinalityKeyValue(HighCardinalityKeyNames.REQUEST_TOP_K.asString(), KeyValue.NONE_VALUE)
 			.hasHighCardinalityKeyValue(HighCardinalityKeyNames.REQUEST_TOP_P.asString(), "1.0")
-			.hasHighCardinalityKeyValue(HighCardinalityKeyNames.RESPONSE_ID.asString(), responseMetadata.getId())
+			.hasHighCardinalityKeyValue(HighCardinalityKeyNames.RESPONSE_ID.asString(),


This was flaky, Mistral AI doesn't always return a Response ID in streaming mode.

ThomasVitale · 2024-08-22T06:19:36Z

.../main/java/org/springframework/ai/chat/observation/ChatModelCompletionObservationFilter.java

 		chatModelObservationContext
 			.addHighCardinalityKeyValue(ChatModelObservationDocumentation.HighCardinalityKeyNames.COMPLETION
-				.withValue(ChatModelObservationContentProcessor.concatenateStrings(completions)));
+				.withValue(TracingHelper.concatenateStrings(completions)));


Extracted the logic into the TracingHelper to re-use it for the vector store observations.

ThomasVitale · 2024-08-22T06:20:01Z

...re/src/main/java/org/springframework/ai/observation/conventions/AiObservationAttributes.java

-	/**
-	 * The name of the operation or command being executed.
-	 */
-	DB_OPERATION_NAME("db.operation.name"),;


Moved to VectorStoreObservationAttributes

ThomasVitale · 2024-08-22T06:20:33Z

spring-ai-core/src/main/java/org/springframework/ai/observation/conventions/SpringAiKind.java

+
+	// @formatter:off
+
+	CHAT_CLIENT("chat_client"),


The first two values here are not used yet, I didn't want to make the PR even bigger. We can adopt them in a separate PR.

ThomasVitale · 2024-08-22T06:22:08Z

.../org/springframework/ai/vectorstore/observation/DefaultVectorStoreObservationConvention.java

 import io.micrometer.common.KeyValues;

 /**
+ * Default conventions to populate observations for vector store operations.


The main change here is a resorting to feed Micrometer already sorted list of key values for a performance improvement, and updating the observation name to be closer to the semantic conventions

* Consolidate usage of “db.collection.name” attribute to track table name, collection name, index name, document name, or whatever concept a vector database uses to store data. Removed “db.index” that was use sometimes instead of “db.collection.name”. This usage is in line with the OpenTelemetry Semantic Conventions. * Configure query response content to be included as a “span event” instead of a “span attribute” if the backend system supports that, similar to how we do for the model observations. * Structure vector store observation attributes in dedicated enums, including one for the Spring AI Kinds to avoid hard-coding the same value in a lot of places. This follows the OpenTelemetry Semantic Conventions as much as possible. Also, adopt Spring usual non-null-by-default strategy as much as possible. * Align vector store conventions to the chat model ones, and follow alphabetical order for values. This is particularly useful for the convention classes, for which the Micrometer performance of exporting telemetry data improves when key values are added already sorted to the context. * Fix flaky test in Mistral AI. * Improve Qdrant integration tests. Signed-off-by: Thomas Vitale <[email protected]>

tzolov · 2024-08-22T07:38:01Z

I've added .withCollectionName(this.pineconeIndexName) to the PineconeVectorStore

tzolov · 2024-08-22T07:40:21Z

Rebased and merged at 036093a

ThomasVitale force-pushed the enhance-vector-store-observability branch from 29139ee to a5fd8ed Compare August 22, 2024 06:16

ThomasVitale commented Aug 22, 2024

View reviewed changes

ThomasVitale force-pushed the enhance-vector-store-observability branch from a5fd8ed to 89ffd68 Compare August 22, 2024 06:23

tzolov closed this Aug 22, 2024

dev-jonghoonpark mentioned this pull request Jun 19, 2025

chore: Remove unnecessary environment variable validation in QdrantVectorStoreIT #3615

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enhance vector store observability support #1262

Enhance vector store observability support #1262

Uh oh!

ThomasVitale commented Aug 21, 2024

Uh oh!

ThomasVitale Aug 22, 2024

Uh oh!

ThomasVitale Aug 22, 2024

Uh oh!

ThomasVitale Aug 22, 2024

Uh oh!

ThomasVitale Aug 22, 2024

Uh oh!

ThomasVitale Aug 22, 2024

Uh oh!

tzolov commented Aug 22, 2024

Uh oh!

tzolov commented Aug 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enhance vector store observability support #1262

Enhance vector store observability support #1262

Uh oh!

Conversation

ThomasVitale commented Aug 21, 2024

Uh oh!

ThomasVitale Aug 22, 2024

Choose a reason for hiding this comment

Uh oh!

ThomasVitale Aug 22, 2024

Choose a reason for hiding this comment

Uh oh!

ThomasVitale Aug 22, 2024

Choose a reason for hiding this comment

Uh oh!

ThomasVitale Aug 22, 2024

Choose a reason for hiding this comment

Uh oh!

ThomasVitale Aug 22, 2024

Choose a reason for hiding this comment

Uh oh!

tzolov commented Aug 22, 2024

Uh oh!

tzolov commented Aug 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants