Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: CSV Infer Schema now properly supports escaped characters. #13214

Merged
merged 1 commit into from
Nov 4, 2024

Conversation

mnorfolk03
Copy link
Contributor

Which issue does this PR close?

Closes #13087.

Rationale for this change

CsvFormat's infer_schema can now properly parse CSV data with escape characters.

What changes are included in this PR?

Minor changes to CSV parser's infer_schema_from_stream function & a new test for these changes

Are these changes tested?

yes

Are there any user-facing changes?

no

@github-actions github-actions bot added the core Core DataFusion crate label Nov 1, 2024
@mnorfolk03
Copy link
Contributor Author

@alamb if you have some spare time & could add this to your list of PRs to reviews, that would be greatly appreciated!

@alamb alamb added the documentation Improvements or additions to documentation label Nov 3, 2024
Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @mnorfolk03 and @findepi 🦾

@alamb alamb removed the documentation Improvements or additions to documentation label Nov 4, 2024
@alamb
Copy link
Contributor

alamb commented Nov 4, 2024

FYI @kolulu23

@alamb alamb merged commit 0458d30 into apache:main Nov 4, 2024
24 checks passed
@alamb alamb mentioned this pull request Nov 5, 2024
3 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core Core DataFusion crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CsvFormat infer_schema reports UnequalLengths error despite having quotes and escape in its options
3 participants