Add unified query API for external integration #3783

dai-chen · 2025-06-16T22:59:46Z

Description

This PR introduces a new api module containing the UnifiedQueryPlanner class, which provides a high-level interface for parsing and planning PPL queries. This module is designed to support external consumers such as Spark and CLI without exposing Calcite or OpenSearch internals. README and unit tests are included to document usage and verify correctness.

Related Issues

Resolves #3734

Check List

New functionality includes testing.
New functionality has been documented.
New functionality has javadoc added.
New functionality has a user manual doc added.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Chen Dai <[email protected]>

settings.gradle

Swiddis · 2025-06-19T21:36:15Z

api/README.md

+    .cacheSchema(true)
+    .build();
+
+RelNode plan = planner.plan("source = opensearch.test");


How does one execute the plan after they receive it?

Good question. Currently, the plan isn’t directly executable. As noted in the README, the planner is designed to eventually return an executable plan—either a Calcite physical plan for immediate execution in the current JVM (useful for the OpenSearch plugin and CLI), or a SparkSQL plan for distributed execution by Spark (useful for PPL in Spark).

I initially considered designing the API this way, but haven’t yet found a clean way to model everything within Calcite’s optimizer. I plan to work on this later, especially since PPL in Spark Phase 2 may require it.

api/src/main/java/org/opensearch/sql/api/UnifiedQueryPlanner.java

Signed-off-by: Chen Dai <[email protected]>

dai-chen · 2025-06-23T21:49:44Z

@LantaoJin @penghuo Please have a look when you have a moment. This is currently only for initial phase in opensearch-project/opensearch-spark#1136 so we can begin publishing PRs on Spark side. Thanks!

dai-chen · 2025-06-24T18:00:45Z

There seems flaky test.

2025-06-24T17:39:21.3968640Z 3577 tests completed, 1 failed, 540 skipped
2025-06-24T17:39:21.3969870Z Tests with failures:
2025-06-24T17:39:21.4097140Z  - org.opensearch.sql.calcite.tpch.CalcitePPLTpchIT.testQ19

* Add api module with API and UT Signed-off-by: Chen Dai <[email protected]> * Refactor catalog API and clean up build.gradle Signed-off-by: Chen Dai <[email protected]> * Add cache schema API and refactor UT Signed-off-by: Chen Dai <[email protected]> * Add readme Signed-off-by: Chen Dai <[email protected]> * Add comment for hardcoding query size limit Signed-off-by: Chen Dai <[email protected]> * Add default namespace API with more UTs Signed-off-by: Chen Dai <[email protected]> --------- Signed-off-by: Chen Dai <[email protected]>

Add api module with API and UT

89861b5

Signed-off-by: Chen Dai <[email protected]>

dai-chen self-assigned this Jun 16, 2025

dai-chen added the enhancement New feature or request label Jun 16, 2025

dai-chen added 3 commits June 18, 2025 14:52

Refactor catalog API and clean up build.gradle

197f7d9

Signed-off-by: Chen Dai <[email protected]>

Add cache schema API and refactor UT

89d9394

Signed-off-by: Chen Dai <[email protected]>

Add readme

1308441

Signed-off-by: Chen Dai <[email protected]>

dai-chen changed the title ~~Add unified query API module for external integration~~ Add unified query API for external integration Jun 18, 2025

Add comment for hardcoding query size limit

f124aaa

Signed-off-by: Chen Dai <[email protected]>

dai-chen mentioned this pull request Jun 18, 2025

[FEATURE] Export PPL-Calcite engine as reusable library #3734

Closed

9 tasks

dai-chen marked this pull request as ready for review June 19, 2025 00:16

dai-chen requested review from GumpacG, LantaoJin, MaxKsyunz, Swiddis, YANG-DB, Yury-Fridlyand, acarbonetto, anirudha, derek-ho, forestmvey, joshuali925, kavithacm, mengweieric, noCharger, penghuo, ps48, qianheng-aws, seankao-az and ykmr1224 as code owners June 19, 2025 00:16

Swiddis reviewed Jun 19, 2025

View reviewed changes

Swiddis approved these changes Jun 23, 2025

View reviewed changes

Add default namespace API with more UTs

a55913a

Signed-off-by: Chen Dai <[email protected]>

LantaoJin approved these changes Jun 24, 2025

View reviewed changes

dai-chen merged commit c0858b5 into opensearch-project:feature/unified-ppl Jun 24, 2025
27 of 29 checks passed

dai-chen deleted the add-unified-query-api-module branch June 24, 2025 18:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add unified query API for external integration #3783

Add unified query API for external integration #3783

Uh oh!

dai-chen commented Jun 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

Swiddis Jun 19, 2025

Uh oh!

dai-chen Jun 20, 2025

Uh oh!

Uh oh!

dai-chen commented Jun 23, 2025

Uh oh!

dai-chen commented Jun 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add unified query API for external integration #3783

Add unified query API for external integration #3783

Uh oh!

Conversation

dai-chen commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Check List

Uh oh!

Uh oh!

Swiddis Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

dai-chen Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dai-chen commented Jun 23, 2025

Uh oh!

dai-chen commented Jun 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dai-chen commented Jun 16, 2025 •

edited

Loading