-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Minor: Add SMJ to TPCH benchmark usage #10747
Conversation
I checked the TPCH benchmarks passes with SMJ on and row counts are the same
|
tpch_mem: TPCH inspired benchmark on Scale Factor (SF) 1 (~1GB), query from memory | ||
tpch10: TPCH inspired benchmark on Scale Factor (SF) 10 (~10GB), single parquet file per table | ||
tpch10: TPCH inspired benchmark on Scale Factor (SF) 10 (~10GB), single parquet file per table, hash join | ||
tpch_smj10: TPCH inspired benchmark on Scale Factor (SF) 10 (~10GB), single parquet file per table, sort merge join |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm planning to get rid of tpch_smj* soon and get the hash join type from the user input any bench can run with a choice of join type
Benchmark resultsBenchmarks comparing d6ddd23 (main) and 8353d20 (PR)
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @comphead
* Fix: Sort Merge Join crashes on TPCH Q21 * Fix LeftAnti SMJ join when the join filter is set * rm dbg * Add SMJ to TPCH benchmark usage
Which issue does this PR close?
Closes #10100 .
Rationale for this change
Basically fix for #10380 fixed the issue, I just fixing also the usage info for SMJ
What changes are included in this PR?
Are these changes tested?
Are there any user-facing changes?