-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
4 | 1.4.1 | Resolve OAI-PMH harvesting issues | 5 #10
Comments
This issue represents a deliverable funded by the NIH Aim 4: Improve harvesting and packaging standards to share metadata and data across repositories Our proposed project will significantly improve the widely-used Harvard Dataverse repository to better support NIH-funded research. A critical measure of the GREI program’s success is to standardize the discoverability across generalist repositories. To help with this, we propose to improve the existing harvesting functionality in the Dataverse software based on the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) standard, and coordinate with other repository packaging standards to share or move metadata and data. Dataverse already supports the Bags as defined by the Research Data Alliance (RDA) Research Data Repository Interoperability Working Group. Here we proposed to improve the support for Bags, test it for NIH-funded datasets, and explore and define the appropriate standard to use to move the metadata and data across generalist repositories
|
Who:
|
September update: |
Last updated: October 2022 (no change) (1.4.1, 1.4.2) A spike (Dataverse GitHub Issue IQSS/dataverse-pm#24) has been completed by the team to inventory existing issues with the current harvesting functionality to prepare for upcoming collaborative work on packaging standards. There are 20+ GitHub Issues that were identified, and work has started on those Issues in priority order. The first two of these issues (#8139 and #8484) have been addressed and their fixes integrated into the code base, and have been released in Dataverse 5.12. The Search & Metadata sub Working Group can serve as a forum to explore cross-repository metadata sharing. |
Last updated: Mon Dec 5 2022 (1.4.1, 1.4.2) Work has continued on the initial backlog created from the initial spike. The following issues have been addressed in recent sprints. Trying to set up or complete a harvesting client through the API crashes Dataverse, OAI server: metadataPrefix unknown: Internal server error #37410, Feature Request/Idea: Documentation for the API to create and edit harvesting clients IQSS/dataverse#8267, and OAI-PMH responses indicating errors should be processable by OAI-PMH clients IQSS/dataverse#3797. The Search & Metadata sub Working Group can serve as a forum to explore cross-repository metadata sharing. 70% |
This needs grooming to determine if we have satisfied the deliverable scope.
|
Last updated: Thu Dec 15 2022 before I left for the holiday Work has continued on the initial backlog created from the initial spike. The following issues have been addressed in recent sprints. Expand the suite of automated tests of the Harvesting functionality #8843, invalid schema and metadataNamespace fields in OAI-PMH ListMetadataFormats response #3621, [feature request] stop an harvest job in progress #7940 |
Met with leonid.
In addition:
|
Next steps:
|
Monthly Update January (1.4.1, 1.4.2) Continued work on backlog and revised the remaining work.. Work has continued on the initial backlog created from the initial spike #8574. |
February Update (1.4.1) Continued work on backlog and revised the remaining work.. Work |
March update (1.4.1) This activity was completed at an extent of 85% in year 1 and transferred to year 2. |
draft yr 1 report summary: FY1 Annual Summary This activity was completed at an extent of 85% in year 1. The team created a prioritized list and started with the most critical items. At the end of the year 18 of 27 items have been resolved. The estimated completion attempts to reflect the varying complexity of the items on the list. Year 2 work toward completion will be tracked as yr:2 aim:4 task:1a (2.4.1A) starting at 85% complete. |
References:
Problem Statement
This first year deliverable is clear from the title
Proposed Solution
The focus of the first year is on the metadata.
We have an existing backlog of fixes.
The existing list may not capture all existing problems, but it is extensive.
Deliverables for the first year:
Last updated: Mon Dec 5 2022
Last updated: Thu Dec 15 2022 before I left for the holiday
Report: Dec 2022
Work has continued on the initial backlog created from the initial spike. The following issues have been addressed in recent sprints. Expand the suite of automated tests of the Harvesting functionality #8843, invalid schema and metadataNamespace fields in OAI-PMH ListMetadataFormats response #3621, [feature request] stop an harvest job in progress #7940
70%
Ordered list of Issues that make up this deliverable:
updated: 2023_01_09
Full list is under construction in: #25
┆Issue is synchronized with this Smartsheet row by Unito
The text was updated successfully, but these errors were encountered: