Skip to content

Conversation

@jainankitk
Copy link
Contributor

Description

Enables efficient histogram collection using point range query, when querying field is same as the histogram field

Resolves #14535

Testing

Successfully ran the tests 10k times.

> Task :lucene:sandbox:test
:lucene:sandbox:test (SUCCESS): 10000 test(s)

> Task :lucene:sandbox:wipeTaskTemp
The slowest suites (exceeding 1s) during this run:
  109.33s TestHistogramCollectorManager (:lucene:sandbox)

BUILD SUCCESSFUL in 1m 53s
246 actionable tasks: 92 executed, 154 up-to-date

Initially they were failing due to first bucket occasionally not having any documents, causing finalizePreviousBucket to not move to the next bucket. That also addresses the TODO from earlier

java.lang.AssertionError: expected:<[2=>1041, 3=>671, 1=>1007]> but was:<[]>

Signed-off-by: Ankit Jain <[email protected]>
@jainankitk jainankitk merged commit d9ed519 into apache:main May 6, 2025
7 checks passed
@jainankitk jainankitk deleted the mrt-range branch May 6, 2025 23:35
weizijun added a commit to weizijun/lucene that referenced this pull request May 7, 2025
* main: (27 commits)
  deps(java): bump com.github.luben:zstd-jni from 1.5.7-2 to 1.5.7-3 (apache#14621)
  Improve user-facing docs for geo package (apache#14534)
  Enabling histogram collection for PointRangeQuery (apache#14560)
  Move sloppySin into SloppyMath from GeoUtils (apache#14516)
  Rewrite APIJAR extractor to use Java 24 classfile API and kill ASM dependency also for build system (apache#14613)
  CHANGES entry for apache#14226 (optimistic KNN Query)
  OptimisticKnnVectorQuery (apache#14226)
  Fix for Windows (spaces in paths) apache#14608
  Update jdk requirements in README to OpenJDK 24 (apache#14610)
  Always check gradle wrapper sha checksum and download if necessary (apache#14608)
  Fix changelog verifier (apache#14606)
  MultiRange query for SortedNumeric DocValues (apache#14404)
  Remove RANDOM_PRELOAD read advice, which is not actually used (apache#14593)
  Remove duplicate test (apache#14602)
  Refactor the expressions compiler to use official ClassData BSM with indexed lookup (apache#14602)
  Disallow EA versions to run Gradle (apache#14601)
  Add back-compat indices for 10.2.1
  Add Lucene 10.2.1 version constant
  DOAP changes for release 10.2.1
  Revert "An attempt to make jenkins pass with the currently installed jdk24-ea. To be reverted later. apache#14600"
  ...
@jainankitk jainankitk added this to the 10.3.0 milestone May 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Leverage multi range traversal Histogram Collection for PointRangeQuery

2 participants