Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use CSVW default dialect #341

Open
RickMoynihan opened this issue Jan 29, 2024 · 0 comments
Open

Use CSVW default dialect #341

RickMoynihan opened this issue Jan 29, 2024 · 0 comments

Comments

@RickMoynihan
Copy link
Member

RickMoynihan commented Jan 29, 2024

See also #342

We should:

  1. assume the default dialect if one is not provided.
  2. validate that any dialects provided in the metadata conform to the dialect schema (from the CSVW spec).
  3. validate that they are also in our supported subset of dialects (happy to just go with the default for now), in which case we should fail on anything else.... e.g. failing on anything that isn't our default delimiters and explicitly UTF-8 would be fine.

We should probably split this up into several issues.

The default dialect in the spec is:

{
  "encoding": "utf-8",
  "lineTerminators": ["\r\n", "\n"],
  "quoteChar": "\"",
  "doubleQuote": true,
  "skipRows": 0,
  "commentPrefix": "#",
  "header": true,
  "headerRowCount": 1,
  "delimiter": ",",
  "skipColumns": 0,
  "skipBlankRows": false,
  "skipInitialSpace": false,
  "trim": false
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant