Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

4 | 1.4.2 | Create working group on packaging standards to share metadata and data across repositories | 5 #12

Closed
sync-by-unito bot opened this issue Oct 7, 2022 · 7 comments
Assignees
Labels
pm.GREI https://docs.google.com/document/d/1RdifpHJDFqx8Y8-Dsv_VnnTgezjNHKpSyRei4cw3C-k/edit?usp=sharing pm.GREI-d-1.4.2 NIH, yr1, aim4, task2: Create working group on packaging standards

Comments

@sync-by-unito
Copy link

sync-by-unito bot commented Oct 7, 2022

Link to Backlog Page

This deliverable is not directly related to code, however, for context.

  • We have bags. There was work done under Harvard DataCommons towards this.
  • The DVUploader, while it still has caveats, can read bags and recreate a dataset in Dataverse.

┆Issue is synchronized with this Smartsheet row by Unito

@mreekie mreekie self-assigned this Oct 7, 2022
@mreekie
Copy link
Collaborator

mreekie commented Oct 7, 2022

This issue represents a deliverable funded by the NIH
This deliverable supports the NIH Initiative to Improve Access to NIH-funded Data

Aim 4: Improve harvesting and packaging standards to share metadata and data across repositories

Our proposed project will significantly improve the widely-used Harvard Dataverse repository to better support NIH-funded research.

A critical measure of the GREI program’s success is to standardize the discoverability across generalist repositories. To help with this, we propose to improve the existing harvesting functionality in the Dataverse software based on the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) standard, and coordinate with other repository packaging standards to share or move metadata and data.

Dataverse already supports the Bags as defined by the Research Data Alliance (RDA) Research Data Repository Interoperability Working Group.

Here we proposed to improve the support for Bags, test it for NIH-funded datasets, and explore and define the appropriate standard to use to move the metadata and data across generalist repositories

  • This will help with a sustainable and succession plan.
    • if one repository cannot support anymore a specific dataset, it will allow to easily move the dataset to another repository without losing any information about the dataset.
  • Additionally we propose to implement Signposting in the Dataverse software.
    • By adding additional http link headers throughout the application, we can more easily support automated metadata and data discovery in the repository, and allow for other applications and services to more accurately and completely represent the content in the Harvard Dataverse repository.
. Aim Deliverable
1.4.1 4 Resolve OAI-PMH harvesting issues
1.4.2 4 Create working group on packaging standards to share metadata and data across repositories
2.4.1 4 Implement packaging standards based on working group feedback
3.4.1 4 Test packaging and harvesting with other generalist repositories
4.4.1 4 Assess and improve packaging and harvesting across repositories

@mreekie
Copy link
Collaborator

mreekie commented Nov 1, 2022

This deliverable is not directly related to code, however, for context.

  • We have bags. There was work done under Harvard DataCommons towards this.
  • The DVUploader, while it still has caveats, can read bags and recreate a dataset in Dataverse.

@mreekie
Copy link
Collaborator

mreekie commented Nov 4, 2022

September update:
(1.4.1, 1.4.2) A spike (Dataverse GitHub Issue IQSS/dataverse-pm#24) has been completed by the team to inventory existing issues with the current harvesting functionality to prepare for upcoming collaborative work on packaging standards. There are 20 GitHub Issues that were identified, and work has started on those Issues in priority order. The first two of these issues (#8139 and #8484) have been addressed and their fixes integrated into the code base, to be released in Dataverse 5.12. The Search & Metadata sub Working Group can serve as a forum to explore cross-repository metadata sharing.

@mreekie
Copy link
Collaborator

mreekie commented Feb 8, 2023

monthly update - Jan

@mreekie
Copy link
Collaborator

mreekie commented Mar 3, 2023

monthly update

@mreekie mreekie transferred this issue from IQSS/dataverse Mar 3, 2023
@mreekie mreekie added the pm.GREI https://docs.google.com/document/d/1RdifpHJDFqx8Y8-Dsv_VnnTgezjNHKpSyRei4cw3C-k/edit?usp=sharing label Mar 3, 2023
@mreekie mreekie added the pm.GREI-d-1.4.2 NIH, yr1, aim4, task2: Create working group on packaging standards label Mar 18, 2023
@mreekie
Copy link
Collaborator

mreekie commented Apr 10, 2023

(1.4.2, 2.4.1) 1.4.2 related activity was completed at an extent of 50% in year 1 and transferred to year 2. The Dataverse team meets regularly with several actors including other colleagues from the GREI initiative. No formal working group has been created yet.

@mreekie
Copy link
Collaborator

mreekie commented Apr 18, 2023

Draft for year 1 summary: FY1 Annual Summary

This activity was completed at an extent of 50% in year 1. A formal working group was not created in year one. However, the Browse and Search sub Working Group explored cross-repository metadata sharing in its work. The Dataverse team also met regularly with colleagues from the GREI initiative on this topic. Year 2 work toward completion will be tracked as yr:2 aim:4 task:2A (2.4.2A) starting at 50% complete.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pm.GREI https://docs.google.com/document/d/1RdifpHJDFqx8Y8-Dsv_VnnTgezjNHKpSyRei4cw3C-k/edit?usp=sharing pm.GREI-d-1.4.2 NIH, yr1, aim4, task2: Create working group on packaging standards
Projects
Status: No status
Development

No branches or pull requests

1 participant