-
Notifications
You must be signed in to change notification settings - Fork 175
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(tpcds-benchmarking): Add basic tpcds benchmarking for local testing #3509
Conversation
CodSpeed Performance ReportMerging #3509 will degrade performances by 57.41%Comparing Summary
Benchmarks breakdown
|
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #3509 +/- ##
==========================================
- Coverage 77.55% 77.54% -0.01%
==========================================
Files 709 709
Lines 86288 86286 -2
==========================================
- Hits 66917 66911 -6
- Misses 19371 19375 +4 |
not necessary, but IMO it'd be nice to add this to the something like:
|
This test currently fails as we underestimate by 2x Co-authored-by: Jay Chia <[email protected]@users.noreply.github.com>
Overview
This PR enables TPC-DS benchmarking on your local computer.
If the TPC-DS parquet data does not exist, it will be created for you (inside of
benchmarking/tpcds/data
). You can configure the location of this output via the--tpcds-gen-folder
argument provided by the script.Usage
For running with the native runner, run:
DAFT_RUNNER=native python -m benchmarking.tpcds --questions "3"
For executing with the ray runner, run:
DAFT_RUNNER=ray python -m benchmarking.tpcds --questions "3"
You can also specify different scale-factors (e.g.,
--scale-factor 0.5
) and dry-run modes (e.g.,--dry-run
).