-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add possibility to describe attributes of a dataset/distribution #183
Comments
+1 ! 👍 |
@metaodi very interesting. This seems to be what in statistics is called https://ec.europa.eu/eurostat/web/sdmx-web-services/data-struct-def. It's definitely worth discussing. A couple of questions:
|
The information on the page and as downloadable resource (frictionless datapackage or similar) are in my opinion complementary. It is nice for a power user to have the datapackage, but you also have to account for the more casual audience unable to work with such a file. For those, a table with the attributes description and types could do wonders to correctly understand the data. |
I think it's an important part of the metadata to be able to find/search for attributes. On data.stadt-zuerich.ch all attributes and their descriptions are part of the search index, so you can find a dataset by the description of it's data. I honestly don't know why this is not part of DCAT so far. But I'm sure this is the reason for it's current implementation on opendata.swiss 😉 |
Just fyi: Some data publishers still found a way to somehow bring this information to the users: |
@metaodi Your issue really resonates with me, since this was also a question that was sort of always on my mind. I am myself coming from the datascience side and without proper description of the fields, tabular data such as csv files can't really be used for data analysis. But this issue is not an issue of DCAT-AP CH: it is already build into DCAT, that does not offer any vocabulary in that regard. Therefore Inspired by your cause, I raised an issue with DCAT to better understand DCAT's reasoning on this. The discussion there might interest you and maybe you also want to join in: w3c/dxwg#1418 |
I feel that DCAT doesn't and shouldn't have too much
I agree... DCAT is the upper, "generic" information layer on data (data catalogue vocabulary) with interoperability as a primary goal - it shouldn't go too deep and mix with domain standards like SDMX, FHIR,... it should just reference the necessary information to understand and use data (see for instance https://www.w3.org/TR/vocab-dcat-2/#Property:distribution_conforms_to). Yet having a standardized form to describe and present variables could be really valuable...! |
Comment by @makxdekkers in w3c/dxwg#1418:
So this could very well be something DCAT-AP Switzerland could define without violating the DCAT Standard. |
What about adding |
In the current version of the draft i see the 'conforms-to' property only at the dataset level (https://www.dcat-ap.ch/releases/2.0/dcat-ap-ch.html#dataset-conforms-to). Is it planned to add it at the distribution-level too or will it be limited to |
@tlorusso The property |
To my knowledge there is currently no way to describe attributes of a dataset (e.g. columns of a CSV). This would include the following information (minimal):
On http://data.stadt-zuerich.ch we provide this information on a dataset level (i.e. it does not differ between distributions).
Example: Daten der Verkehrszählung zum motorisierten Individualverkehr (Stundenwerte), seit 2012
The text was updated successfully, but these errors were encountered: