-
Notifications
You must be signed in to change notification settings - Fork 215
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Attribution: XML/RDF/Turtle please. #4430
Comments
I added "frontend" label because I think this refers to the frontend single result page's "Credit the creator" section, not the API's "attribution" property. |
Attribution formats like XML should be supported by the API as well imo, so having both is 👌 . |
DC sounds great! CC REL already uses DC terms, and the rich-text/HTML version of the attribution would be relatively easy to translate into a DC XML fragment. |
I'd love to take on this. |
Currently all frontend attribution generation happens in JavaScript: https://github.com/WordPress/openverse/blob/main/frontend/src/utils/attribution-html.ts. The python The frontend's The XML snippet should just be another option of output. You can use the existing methods for generating HTML to generate the XML. Are you familiar with DC or RDF @madewithkode? There are a lot of resources online about both, but DublinCore's own documentation tends to be the best, and here's their documentation about RDF/XML specifically: https://www.dublincore.org/specifications/dublin-core/usageguide/#rdfxml and https://www.dublincore.org/specifications/dublin-core/dc-xml-guidelines/ The snippet there already gives a good idea of how to add the parts we'd need, it's essentially 1:1 with that, except we'd also populate <rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:>
<rdf:Description rdf:about="https://www.flickr.com/photos/126953422@N04/40593461235">
<dc:creator>Graham Winterflood</dc:creator>
<dc:title>Galah in Darwin (Eolophus roseicapilla)</dc:title>
<dc:rights>"Galah in Darwin (Eolophus roseicapilla)" by Graham Winterflood is licensed under CC BY-SA 2.0. To view a copy of this license, visit https://creativecommons.org/licenses/by-sa/2.0/?ref=openverse.</dc:rights>
</rdf:Description>
</rdf:RDF> That interprets However, I think we shouldn't create the full RDF XML, and instead, just offer the DC elements as XML (and we could follow this up by offerring different formats like Turtle or JSON-LD in the future, as separate issues). So then, we'd just have a copyable snippet, with some explanatory text. Maybe like this: <dc:creator>Graham Winterflood</dc:creator>
<dc:title>Galah in Darwin (Eolophus roseicapilla)</dc:title>
<dc:rights>"Galah in Darwin (Eolophus roseicapilla)" by Graham Winterflood is licensed under CC BY-SA 2.0. To view a copy of this license, visit https://creativecommons.org/licenses/by-sa/2.0/?ref=openverse.</dc:rights>
<dc:identifier>https://www.flickr.com/photos/126953422@N04/40593461235</dc:identifier>
<dc:type>StillImage</dc:type>
It can be that simple, if we like. @midijohnny please let me know if I've got this wrong... I'm basing this on just 6 months of Library and Information Services courses I took recently, and only did a small amount of DC, but never anything in XML. I don't think we should try to use DCMI terms (like implementing |
Assigned to you, @madewithkode, but it's probably a good idea to wait for @midijohnny to give more input before going to strongly in one direction (snippet, full RDF, which terms to use, etc). I do think it's best to stick with just an XML snippet for this first issue. |
I agree with you @sarayourfriend, any more extra/specific details regarding what's required would be appreciated. And thank you for the really indepth insights on this topic, I'd be sure to checkout the resources you shared as I do not have any prior experience with all the other markups/specifications being discussed asides XML. I'd standby on this a bit to see if @midijohnny has anything more to add before getting started. |
Great discussion ! I'm not an expert in RDF or Dublin Core either - but I would say the example above ("Maybe like this...") is going to be good enough - with one minor alteration - to include a root element with a namespace identifier. So something like:
It doesn't have to be 'metadata' - it could be (say) 'attribution' or whatever you think it best. Having a well-formed document like this - with the namespace included (so people can look up the vocabulary based on the namespace) would provide a large benefit I think. It means (for instance) somebody downstream can build an XSLT to transform this to what suits them. For my purposes: I was collecting images to display in an XHTML (i.e. well-formed XML) environment, so if I had the format above it would have made my life easier. |
For additional context - here's why I logged the original request. |
Perfect, thanks very much @midijohnny! I was wondering how best to include the namespace, that looks great. And makes things more flexible for the future if we want to implement CC REL. @madewithkode how do you feel about starting on this, when you have time? Do you feel you have enough to go on to get started? |
Sorry I'm late guys, been battling a flu. Really great insights and extra contexts @midijohnny |
No worries at all, take your time and get well soon! There's no rush or pressure with this. |
I wanted to share some prior art here concerning XML. The Dublin Core we're adding in #4499 looks good. I also remembered today that Creative Commons' own License Chooser offers Extensible Metadata Platform (XMP) format, which is XML in a .xmp file. Fun fact: @obulat implemented it a few years ago in this PR: creativecommons/chooser#272. A small change was made to that implementation shortly after. I wonder if we should support that format as well? |
What's the use case for downloading an XMP snippet? Wouldn't you use your image editor software to add that data, either embedded as an EXIF extension or as a sidecar file? I didn't know it could (or would) be used for attribution, I'm only familiar with it for use to describe the immediate work. I guess if you can add arbitrary additional metadata, attributions would just go there? I'm not sure how you would structure attributions in DC. An array of RightsStatements? |
The only (possible) use case I can think of is in the context of remixing works, where you might want to store or modify the original XMP for your new, derived work. Even that feels somewhat contrived though. Probably best to wait on that until someone with a clear use case asks for it, as happened with dublin core here! 😄 |
That XML tab looks really good - nice one, should be a good benefit for people - it would have saved me time and effort for sure when I was building my little demo app! |
It would be handy if there was some additional formats for the "Credit the creator" section.
In particular - I would suggest at least it should include a simple well-formed XML record.
Better: one that corresponds to the Dublin Core specification : https://www.dublincore.org/specifications/dublin-core/dcmi-terms/
Or RDF in general - including a 'Turtle format'.
(Although publishing in Dublin Core XML format would be enough for others to automatically translate this to other forms of RDF probably).
I would suggest this would also encourage more compliance with attribution , since it would be easier for the author to automatically credit creators.
The text was updated successfully, but these errors were encountered: