-
Notifications
You must be signed in to change notification settings - Fork 902
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Broken tests in debug build #6521
Comments
I'm hoping that cudaIllegalMemoryAccess failures are all the same problem. May be related to rapidsai/rmm#563. |
This issue has been marked rotten due to no recent activity in the past 90d. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. |
Does this issue still persist? I see that all the tests status are "fixed". |
This is still a valid bug. I faced this yesterday, I only ran tests on the Java bindings and saw a heap of Please find the logs attached |
…tor in thrust::lower_bound (#8432) Closes #6521 The `thrust::lower_bound` call is crashing on a libcudf debug build when using the `output_indexalator`. I've opened [an issue in the thrust github](NVIDIA/thrust#1452) keep track of this. The problem only occurs when using the `-G` nvcc compile option. I found a workaround using a `thrust::transform` along with device lambda containing a `thrust::lower_bound(seq)` call for each element. This PR adds the workaround which is only used in a debug build since the error occurs in functions that used as utilities for other functions when using dictionary columns. Authors: - David Wendt (https://github.com/davidwendt) Approvers: - Devavret Makkar (https://github.com/devavret) - Karthikeyan (https://github.com/karthikeyann) URL: #8432
Summary
When compiling
libcudf.so
with debug symbols, not all tests pass / execute without error.The following is a list of tests that either fail or error:
COPYING_TEST
: does not compileDICTIONARY_TEST
: illegal memory accessERROR_TEST
: failureFIXED_POINT_TEST
: illegal memory accessGROUPBY_TEST
: failureINTEROP_TEST
: illegal memory accessJOIN_TEST
: unknown, indefinite hangREDUCTION_TEST
: failureREPLACE_NULLS_TEST
: failureREPLACE_TEST
: illegal memory accessSEARCH_TEST
: illegal memory accessTEXT_TEST
: illegal memory accesswhere "illegal memory access" indicates that, at some point a
cudaErrorIllegalAddress
error was detected, and "failure" indicates that the test simply failed. I have attached a more complete report detailing precisely which sub-test prompted the error(s) or failure(s): cudf-debug-test-log.txtReproducing the failures
Build
libcudf.so
with debugging symbols by cloning PR #6134 and configuring withcmake .. -DCMAKE_BUILD_TYPE=Debug
. Alternatively, if you're using RAPIDS compose, simply fetch/checkout the PR, and build the C++ cudf library withbuild-cudf-cpp --debug
.Expected behavior
All tests should pass regardless of build type.
Environment details
Built from source.
print_env_out.txt
Status
COPYING_TEST
fixedDICTIONARY_TEST
fixedERROR_TEST
fixedFIXED_POINT_TEST
fixedGROUPBY_TEST
fixedINTEROP_TEST
fixedJOIN_TEST
fixedREDUCTION_TEST
fixedREPLACE_NULLS_TEST
fixedREPLACE_TEST
fixedSEARCH_TEST
fixedTEXT_TEST
fixedThe text was updated successfully, but these errors were encountered: