-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test broadcast merge in local_cudf_merge benchmark #507
Conversation
run tests |
Now that dask#7143 was merged, it is now safe to consider this PR for integration. |
Retargeting seems to have no conflicts, let's see what the tests say. 🙂 |
Could you please run |
They still run, so we can wait another hour or so to see the results, but feel free to restyle it anyway in the meantime Rick. |
Ah ok. If they still run, great let's wait Know that previously changelog (ofc no longer included) and style were required to pass before running tests. Given this, couldn't tell if the jobs here were actually queued to run or maybe we were running into some edge case in gpuCI where they only appear queued (but don't actually run). |
Codecov Report
@@ Coverage Diff @@
## branch-0.19 #507 +/- ##
===============================================
+ Coverage 62.37% 63.77% +1.39%
===============================================
Files 22 22
Lines 2517 2523 +6
===============================================
+ Hits 1570 1609 +39
+ Misses 947 914 -33
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I left a minor nitpick, but other than that LGTM. Thanks @rjzamora !
Co-authored-by: Peter Andreas Entschev <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @rjzamora !
Looks like we are seeing some failures in |
Hmm - It doesn't seem like that test is running the benchmark modified in this PR, is it? |
Was this from a change in Dask or Distributed then? Both of those are planning to be released tomorrow. So if they have issues, we should try to identify those quickly |
Just to update this thread, Rick did find a recent Dask change that is causing failures here as mentioned in this comment ( dask/dask#7305 (comment) ) in the PR with the change. We have surfaced this in the Dask release issue ( dask/community#129 (comment) ). Trying to figure out now how best to address this before the Dask + Distributed release tomorrow |
I'm attempting to fix the issue mentioned above in dask/dask#7325 . |
rerun tests Edit: As Peter's fix has landed. Thanks Peter! 😄 |
@gpucibot merge |
Includes the necessary changes to test dask#7143. More specifically, this adds the following options:
--base-chunks
: Number of base-DataFrame partitions (default: n_workers)--other-chunks
: Number of other-DataFrame partitions (default: n_workers)--broadcast-join
: Use broadcast join when possible--shuffle-join
: Use shuffle join (takes precedence over '--broadcast-join')