[BUG] CUDF does not strip trailing white space after a quoted string value #2069
Labels
bug
Something isn't working
cudf_dependency
An issue or PR with this label depends on a new feature in cudf
reliability
Features to improve reliability or bugs that severly impact the reliability of the plugin
Describe the bug
When spark sees a line like
It sees that there is white space after A, and because A is quoted it will strip off the quotes and the trailing white space. CUDF sees any white space at the end after the quote and assumes that all of the data should be a part of the string so it produces a value of
"A"
Steps/Code to reproduce bug
We have an integration test for this in the CSV tests.
test_basic_read
for the fileints_with_whitespace.csv
but with the schema for strings instead of bytes.The text was updated successfully, but these errors were encountered: