Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

new voting sheets, generated in Python (notebook) not Makefile #270

Conversation

turbomam
Copy link
Member

@turbomam turbomam commented Nov 14, 2024

needs better documentation/organization/refresh plans for the files currently in notebooks/environmental_context_value_sets/

mam-env-package-overrides.tsv: can't be recreated from within this repo. do not delete.

common.py: nobody uses this yet. there are some functions that use global dicts for very crude caching

pulled on demand, not committed:

  • ncbi_biosamples_2024-09-23.duckdb
  • ncbi_biosamples_2024-09-23.duckdb.gz
  • goldterms.db
  • goldterms.db.gz

  • ncbi_biosamples_2024-09-23_package_usage.tsv
  • nmdc_env_package_predictions.tsv
  • environmental_context_value_sets.Makefile
  • environmental_context_value_sets/README.md

@turbomam turbomam linked an issue Nov 14, 2024 that may be closed by this pull request
@turbomam turbomam changed the title recapitulate soil env_broad_scale voting sheet in Python (notebook) not Makefile recapitulate voting sheet generation in Python (notebook) not Makefile Nov 15, 2024
@turbomam turbomam changed the title recapitulate voting sheet generation in Python (notebook) not Makefile new voting sheets, generated in Python (notebook) not Makefile Nov 22, 2024
@turbomam
Copy link
Member Author

turbomam commented Nov 22, 2024

Here are the PO terms that were used by any of our sources for plant-associated env_medium::

it appears that they are all present in EnvO too

PO:0000003,whole plant
PO:0009007,portion of plant tissue
PO:0009008,plant organ
PO:0009011,plant structure
PO:0025025,root system
PO:0025034,leaf

If they have a common root, maybe we should include more terms from that hierarchy?

I guess the current envo_native column in the voting sheets doesn't do a good job of distinguishing EnvO native terms, terms imported into EnvO from other sources, and CURIes that use the ENVO prefix but don't really denote any EnvO class.

@turbomam
Copy link
Member Author

turbomam commented Nov 22, 2024

Here's their is_a tree

plant-associated-biosample-env-medium-po-terms-is_a-to-mat-ent

@turbomam
Copy link
Member Author

We could add more of these plant structure descendants without needing to consult any additional ontologies

plant-structures-in-envo

@turbomam turbomam marked this pull request as ready for review December 6, 2024 22:30
@turbomam turbomam requested a review from cmungall December 6, 2024 22:30
@turbomam turbomam closed this Dec 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

quick prototype of voting sheet generation in notebook in this repo
1 participant