We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cudf::io::json::detail::normalize_single_quotes
\n
Reproducible with this input:
{\"a\": \"1\n2\"} {\'a\': 12}
The output tokens, generated by cudf::io::json::detail::get_token_stream after preprocessing with cudf::io::json::detail::normalize_single_quotes are:
cudf::io::json::detail::get_token_stream
Input: {"a": "1 2"}{'a': 12} Tokens: 0, 4, 6, 7, 8, 9, 5, 1, 0, 1 Token indices: 0, 1, 1, 3, 6, 10, 11, 11, 0, 0
If remove the \n character then the output is correct:
Input: {"a": "12"}{"a": 12} Tokens: 0, 4, 6, 7, 8, 9, 5, 1, 0, 4, 6, 7, 10, 11, 5, 1 Token indices: 0, 1, 1, 3, 6, 9, 10, 10, 12, 13, 13, 15, 18, 20, 20, 20
Note:
\0
allow_unquoted_control
true
static_cast
enum token_t
cudf/cpp/include/cudf/io/detail/tokenize_json.hpp
Line 88 in 06b3f83
I suspect that it is due to the leftover character \n in
cudf/cpp/src/io/json/json_normalization.cu
Lines 76 to 77 in 06b3f83
The text was updated successfully, but these errors were encountered:
5cbdcd0
shrshi
Successfully merging a pull request may close this issue.
Reproducible with this input:
The output tokens, generated by
cudf::io::json::detail::get_token_stream
after preprocessing withcudf::io::json::detail::normalize_single_quotes
are:If remove the
\n
character then the output is correct:Note:
\0
, not\n
.allow_unquoted_control
is set totrue
.static_cast
fromenum token_t
atcudf/cpp/include/cudf/io/detail/tokenize_json.hpp
Line 88 in 06b3f83
I suspect that it is due to the leftover character
\n
incudf/cpp/src/io/json/json_normalization.cu
Lines 76 to 77 in 06b3f83
The text was updated successfully, but these errors were encountered: