Cache parquet schema fields & update libchdb #32

s0und0fs1lence · 2025-08-25T17:21:26Z

This PR avoids the costly call of ColumnTypeDatabaseTypeName function in the hotpat of the Next method.
the schema of the parquet will never change during the Next() call untill the cursor is alive; so we just calculate them in the PrepareRows and PrepareStreamingRows to avoid a heap allocation for the slice of the fields on each iteration.

This overhead could lead to GC pressure in big datasets and thus decreasing the performance of the entire driver.

Also, i've bumped libchdb to the latest version, and start using the new API

s0und0fs1lence added 5 commits August 25, 2025 17:15

cache schema fields to avoid memory allocation on every Next() call

28304a8

remove unused method

e203b48

bump chdb version

c37c4ee

fix error messages

fda725c

fix close conn

8872c02

s0und0fs1lence changed the title ~~Cache parquet schema fields~~ Cache parquet schema fields & update libcdhb Aug 31, 2025

auxten changed the title ~~Cache parquet schema fields & update libcdhb~~ Cache parquet schema fields & update libchdb Aug 31, 2025

auxten merged commit 173e057 into chdb-io:main Aug 31, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Cache parquet schema fields & update libchdb #32

Cache parquet schema fields & update libchdb #32

Uh oh!

s0und0fs1lence commented Aug 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Cache parquet schema fields & update libchdb #32

Cache parquet schema fields & update libchdb #32

Uh oh!

Conversation

s0und0fs1lence commented Aug 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

s0und0fs1lence commented Aug 25, 2025 •

edited

Loading