-
Notifications
You must be signed in to change notification settings - Fork 118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support ignoreRows for TabularResource #344
Support ignoreRows for TabularResource #344
Comments
closely related to #326 |
@roll i'm super cautious about this kind of stuff as it is a place where "ETL" logic starts to bleed into the spec and that's a slippery slope. If you delete rows, what about columns, what about transforms etc etc. Thus, my sense is that ETL stuff like this should not go into the spec for now - at most it should be in patterns and even there i'm cautious. PS: i am willing to consider #326 because it is so common and it is about presence of a header row. |
@rufuspollock |
AGREED with @pwalsh: this should go to "Best Practice" rather than spec for now. |
Overview
Resource
specification is created to describe concrete data source with metadata. When we deal with concrete real world data sources there could be some corner case like commented rows or blank rows on top etc. A publisher needs an ability to share this information with implementations.Example
https://github.com/frictionlessdata/ADB-User-Study/blob/master/metadata.tsv
It's a valid resource (checked by
goodtables
) except row 2 and 3 which are comments and can't be removed because it's vital metadata for this publisher tools.Proposal
Introduce
ignoreRows
(orskipRows
orinformationalRows
or ?) attribute forTabularResource
specification. This attribute MUST be an array of integers and strings where:Example
ignoreRows = [1, 2, "#","//"]
Related
Headers is another example where publisher could be in need of more granular control over data source rows - #326
References
The text was updated successfully, but these errors were encountered: