Resolve the conflict between encrypted derived columns and primary key columns(#36841) #36899

wangmiscoding · 2025-10-18T10:26:37Z

Changes proposed in this pull request:

I have fully reproduced this issue locally and identified the root cause.

Reproduction scenario:
When the encrypted column is the last field in the table, and both encrypted assisted query columns and Snowflake algorithm for generating primary key IDs are enabled.

Root cause:
In this insert SQL, Sharding needs to ensure the final SQL includes inserts for both the order_id and assisted_username columns, requiring SQL rewriting by Sharding.

The SQL rewriting process involves the following steps:

Add the values to be inserted - the Snowflake-generated order_id value (1186337235088179200) and the assisted column value generated from username (9de37a0627c25684fdd519ca84073e34):
a. The logic for adding order_id is located in GeneratedKeyInsertValueParameterRewriter#rewrite, which appends the value 1186337235088179200 to the end of the parameter list.
b. The logic for adding assisted_username is located in EncryptInsertValueParameterRewriter#rewrite, which finds the position of username in the list and places the assisted_username value immediately after it.

This results in the value list becoming:
(1, 23.4, 2025-10-18 16:16:21.7380934, 1, PENDING, null, bg+LQMPvSU8G0cnJuWM/1g==, 9de37a0627c25684fdd519ca84073e34, 1186341196495060992)

Add the column names to be inserted - both order_id and assisted_username:
a. In org.apache.shardingsphere.infra.rewrite.sql.token.keygen.generator.GeneratedKeyInsertColumnTokenGenerator#generateSQLToken,
order_id is inserted at the end of the column name list.
b. In org.apache.shardingsphere.encrypt.rewrite.token.generator.insert.EncryptInsertDerivedColumnsTokenGenerator#generateSQLTokens,
the derived assisted_username is placed after username.

The problem arises here:
Since username itself is already the last field, placing assisted_username after username is equivalent to placing it at the end, causing disorder between assisted_username and order_id.

As shown in the figure, the insertion indexes become identical.

Therefore, to resolve this issue, we need to ensure consistent ordering between the derived parameter columns and derived value columns.

The approach I'm taking:
Modify the derivation logic to keep the derived primary key token at the very end. This way, even if the encrypted column is in the last position, the derived auxiliary column will still appear before the primary key.

I believe this approach requires minimal changes. I appreciate everyone's perspectives on this solution

Before committing this PR, I'm sure that I have checked the following options:

My code follows the code of conduct of this project.
I have self-reviewed the commit code.
I have (or in comment I request) added corresponding labels for the pull request.
I have passed maven check locally : ./mvnw clean install -B -T1C -Dmaven.javadoc.skip -Dmaven.jacoco.skip -e.
I have made corresponding changes to the documentation.
I have added corresponding unit tests for my changes.
I have updated the Release Notes of the current development version. For more details, see Update Release Note

…y columns(apache#36841)

wangmeng and others added 4 commits October 18, 2025 18:07

Resolve the conflict between encrypted derived columns and primary ke…

65979f6

…y columns(apache#36841)

Resolve the conflict between encrypted derived columns and primary ke…

bbdab69

…y columns(apache#36841)

style(rewrite): Adjust the code format of the SQLToken class

2c4a019

style(rewrite): Adjust the code format

391dda4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Resolve the conflict between encrypted derived columns and primary key columns(#36841) #36899

Resolve the conflict between encrypted derived columns and primary key columns(#36841) #36899

Uh oh!

wangmiscoding commented Oct 18, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Resolve the conflict between encrypted derived columns and primary key columns(#36841) #36899

Are you sure you want to change the base?

Resolve the conflict between encrypted derived columns and primary key columns(#36841) #36899

Uh oh!

Conversation

wangmiscoding commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes proposed in this pull request:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

wangmiscoding commented Oct 18, 2025 •

edited

Loading