You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This PR fixes a bug where output sample batch ids do not match those expected when using the bulk sampler, causing subgraphs that are larger than expected and incorrect. Without reindexing, the wrong batch ids are assigned to the start vertices. Reindexing ensures that the same order is preserved for batch ids and start vertices.
This PR also changes the empty dataframe passed to dask in `uniform_neighbor_sample` to match the correct ordering of batch_id and hop_id. This ensures that the columns are named correctly and are not inadvertently renamed due to them being created in a different order.
This PR is non-breaking because it restores the original behavior of bulk sampling and reverses a bug that was inadvertently introduced with the dask updates.
Resolves#3390
Authors:
- Alex Barghi (https://github.com/alexbarghi-nv)
Approvers:
- Rick Ratzel (https://github.com/rlratzel)
- Vibhu Jawa (https://github.com/VibhuJawa)
- Joseph Nke (https://github.com/jnke2016)
URL: #3393
The test run was on a 2-node 16-GPU (total) configuration:
Both tests had the same problem:
The text was updated successfully, but these errors were encountered: