Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[C++] Enable multiple character delimiters in read_csv #32432

Open
asfimport opened this issue Jul 19, 2022 · 1 comment
Open

[C++] Enable multiple character delimiters in read_csv #32432

asfimport opened this issue Jul 19, 2022 · 1 comment

Comments

@asfimport
Copy link
Collaborator

Read_CSV ParseOptions allows only a single character delimiter. Single character delimiters are highly susceptible to the candidate value existing within the data to be loaded, negating the ability to serve as a delimiter.

If a double character delimiter is used, the current limit of a single character returns "only single character unicode strings can be converted to Py_UCS4, got length 2"

Reporter: Jack Howard

Note: This issue was originally created as ARROW-17130. Please see the migration documentation for further details.

@ajaytho
Copy link

ajaytho commented Jul 17, 2023

Looking for multiple character delimiters in read and write. Many a times hex is used as delimiters which is multi-character and limitation for current implementation.

@jorisvandenbossche jorisvandenbossche changed the title Enable multiple character delimiters in read_csv [C++] Enable multiple character delimiters in read_csv Mar 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants