Skip to content

Conversation

@andygrove
Copy link
Member

@andygrove andygrove commented Oct 5, 2024

Closes #10

Changes in this PR:

  • Update to datafusion 42 / arrow-rs 53.1.0
  • Update datafusion.proto
  • Add benchmark script
  • Remove old performance results because they are no longer valid

I can now run TPC-H and see tasks distributed across both nodes in my Ray cluster.

Screenshot from 2024-10-06 09-54-51

@andygrove andygrove changed the title WIP: Add script for running TPC-H Add script for running TPC-H Oct 6, 2024
@andygrove andygrove marked this pull request as ready for review October 6, 2024 15:58
@andygrove andygrove changed the title Add script for running TPC-H Fix FFI errors, and add script for running TPC-H Oct 6, 2024
sql = sql.strip()
if len(sql) > 0:
print(f"Executing: {sql}")
rows = ctx.sql(sql)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How does this time relate to the fact that the context only wait for the first partition ? see https://github.com/apache/datafusion-ray/blob/main/datafusion_ray/context.py#L234

is there always a single partition?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, the final query stage always has a single output partition. You can verify this by looking at the expected plans in the repo.

@andygrove
Copy link
Member Author

andygrove commented Oct 7, 2024

I will go ahead and merge this since the current main branch is broken without this FFI fix

@andygrove andygrove merged commit ead1e4e into apache:main Oct 7, 2024
1 check passed
@andygrove andygrove deleted the tpch branch October 7, 2024 14:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Ray shuffle not working

2 participants