Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Automated metadata population / suggestion #471

Open
erikvdbergh opened this issue Sep 10, 2024 · 1 comment
Open

[FEATURE] Automated metadata population / suggestion #471

erikvdbergh opened this issue Sep 10, 2024 · 1 comment

Comments

@erikvdbergh
Copy link

Is your feature request related to a problem? Please describe.

Filling in metadata is a big task for many researchers, with a lot of redundant data that they need to enter (Such as affiliation, collaborators info etc.). It is also difficult in you are not familiar with standards and terms that are good to use as metadata (as many researchers are). Therefore, adding metadata is often neglected, postponed and forgotten, leading to orphan data in the Research area.

Describe the solution you'd like

We want a prepopulation or suggestions for metadata in the metadata form, based on automated analysis of the data that is being metadated. Preferably, standard fields such as name, affiliation etc. should be autofilled based on login information.

Metadata related related to the content could be suggested based on automated analysis of the data, e.g. by LLM or other analysis library. Preferably this process would output suggestions that are based on standard terms, so that metadata standardisation is maintained.

Describe alternatives you've considered

The alternative is doing it by hand, but like stated in the problem this is often neglected because of the time it takes.

Additional context

This request is inspired by the functionality that ManGO has, based on the Apache Tika library: https://tika.apache.org/. However, with LLMs becoming dominant in the past few years, a LLM based solution might perform better.

@stsnel
Copy link
Member

stsnel commented Oct 2, 2024

Thank you for the proposal! We plan to gather feedback from other stakeholders on this idea, and expect we'll be able to give an update in roughly the coming two months.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants