[BUG] Fix CSV Parsing #2063

revans2 · 2021-04-01T13:02:44Z

Describe the bug
I am not totally sure if this is a bug or an epic, but I have marked it as both. CSV parsing has been on by default, but it has a lot of inconsistencies with what spark does and this can cause problems. The current plan is to mitigate this by disabling CSV by default and then working through issues until we can enable it again everywhere.

Important Issues:

Investigate:

[FEA] Verify unescapedQuoteHandling with CSV reader #1524 More control in newer versions of spark to handle how badly formed CSV is parsed. In this case around unescaped quotes.

Low Priority Issues:

Tests:

Add tests for enforceSchema set to false. TODO file issues Oddly for Spark when you set this to false it does more schema enforcement. The enforce here really means force the set schema on the files. It looks like it works for our plugin, but we want to have tests to verify this
[FEA] Improve CSV integration tests for parsing dates and timestamps #9990

The text was updated successfully, but these errors were encountered:

revans2 added bug Something isn't working ? - Needs Triage Need team to review and classify epic Issue that encompasses a significant feature or body of work labels Apr 1, 2021

revans2 changed the title ~~[WIP] [BUG] Fix CSV Parsing~~ [BUG] Fix CSV Parsing Apr 1, 2021

sameerz removed the ? - Needs Triage Need team to review and classify label Apr 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Fix CSV Parsing #2063

[BUG] Fix CSV Parsing #2063

revans2 commented Apr 1, 2021 •

edited by mattahrens

Loading

[BUG] Fix CSV Parsing #2063

[BUG] Fix CSV Parsing #2063

Comments

revans2 commented Apr 1, 2021 • edited by mattahrens Loading

revans2 commented Apr 1, 2021 •

edited by mattahrens

Loading