Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make a list of Mongo queries people find difficult to write #1604

Closed
turbomam opened this issue Dec 21, 2023 · 5 comments
Closed

Make a list of Mongo queries people find difficult to write #1604

turbomam opened this issue Dec 21, 2023 · 5 comments

Comments

@turbomam
Copy link
Member

turbomam commented Dec 21, 2023

This ticket is deprecated. Please refer to this Github issue and this Google Sheet

There are some NMDC information retrieval tasks that do not seem intuitive.

Let's collect some use cases that call out

  • what the task is
    • list the input knowledge and the output knowledge, like getting the identifiers for all Biosamples that are part_of a given Study
  • who the intended audience is
    • NMDC team members
    • workflows
    • external users

Possible solutions:

  • fancy pure-MongoDB queries over the existing collections
  • ad-hoc Python scripts that integrate results form multiple queries
  • MongoDB views
  • SPARQL
  • existing API endpoints
  • new API endpoints
@eecavanna eecavanna changed the title Library of difficult queries Make a list of Mongo queries people find difficult to write Dec 21, 2023
@turbomam
Copy link
Member Author

I added some people who might have some thoughts on this to the issue assignees, but please add or remove others if necessary.

@brynnz22
Copy link
Contributor

@turbomam here is a mongo query I wrote for one of the notebooks to go from biosample to a data object: https://github.com/microbiomedata/notebook_hackathons/blob/soil-contig-tax/taxonomic_dist_by_soil_layer/python/mongodb_query.txt.js

@aclum
Copy link
Contributor

aclum commented Dec 22, 2023

@turbomam see microbiomedata/nmdc-runtime#355
My preference would be consolidate tickets, no preference which repo we work out of.

@PeopleMakeCulture
Copy link

@dwinston and I have been collecting examples of difficult queries in microbiomedata/nmdc-runtime#355

@mslarae13
Copy link
Contributor

I'm going to close this issue. @brynnz22 @turbomam please copy your comments to the existing issue in runtime.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

9 participants