Skip to content

Conversation

@ttnghia
Copy link
Contributor

@ttnghia ttnghia commented Nov 24, 2025

This removes the kernel call to cudf::valid_if to generate nullmask/null count during the computation of groupby aggregation MERGE_M2. Such null count is unnecessary as the output is already makes sense without nulls: if the input values of a group are all nulls, its output will have count valid field is 0 and that is enough for all further computation, no need to know the null count as well as the null mask for other output fields (avg and m2).

Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
@ttnghia ttnghia self-assigned this Nov 24, 2025
@ttnghia ttnghia requested a review from a team as a code owner November 24, 2025 18:23
@ttnghia ttnghia added the 3 - Ready for Review Ready for review by team label Nov 24, 2025
@ttnghia ttnghia added this to libcudf Nov 24, 2025
@ttnghia ttnghia added libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue Spark Functionality that helps Spark RAPIDS improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Nov 24, 2025
@davidwendt
Copy link
Contributor

This should be probably be marked as a breaking-change since the expected output has changed.

@ttnghia ttnghia added breaking Breaking change and removed non-breaking Non-breaking change labels Nov 24, 2025
Copy link
Contributor

@lamarrr lamarrr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changes LGTM.
Thanks for working on this

@ttnghia
Copy link
Contributor Author

ttnghia commented Nov 24, 2025

/merge

@rapids-bot rapids-bot bot merged commit e9aff53 into rapidsai:main Nov 24, 2025
254 of 261 checks passed
@ttnghia ttnghia deleted the improve_merge_m2 branch November 24, 2025 22:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

3 - Ready for Review Ready for review by team breaking Breaking change improvement Improvement / enhancement to an existing function libcudf Affects libcudf (C++/CUDA) code. Performance Performance related issue Spark Functionality that helps Spark RAPIDS

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

4 participants