-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create python notebook to explore processed NOM data #68
Create python notebook to explore processed NOM data #68
Conversation
…p percentages for single value per nom comparison, visualized in heatmap
…s for goldids etc. removed processed noms from analysis that had same url
…cessed nom files, added multiple peak match check and confidence score choice
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
NOM_visualizations/python/__pycache__/nom_functions.cpython-311.pyc
Outdated
Show resolved
Hide resolved
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:14Z Define NOM and all other acronyms at the start. |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:15Z Line #8. #import nom_functions.py in second_omics_type folder as module to utilize its functions I think this points to the incorrect location now. Can you separate out this import to a second chunk and explain/point to the other notebook that describes these functions more fully? |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:16Z Line #9. import nom_functions as func Can we import/save as nmdc_api or something similar? The functions are not NOM-specific. |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:17Z For explicitness, can we change " object |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:17Z ID should be id |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:18Z Can you add a bit of narrative here about why/how you queried all the fields for this step? Then you can use that to say you'll use the Envo information on the biosamples to label each as a type of sample (which leads into the next step) |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:19Z How about
"Create final data frame of relevant metadata and NMDC schema information for each NOM processed data object" |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:20Z I would move this up one chunk (before the "Create final dataframe" step) |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:27Z Since you're getting warnings with this, I would try to downselect until you do not get warnings or set your settings so warnings are not displayed. We don't want the warnings messages in the notebook. |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:28Z I would add a bit of narrative here about what a Van Krevelen plot is and how they are used. Nothing extensive - think wikipedia-level (https://en.wikipedia.org/wiki/Van_Krevelen_diagram). |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:29Z For the Marginal Density Plot, if the OC/HC ratio is in both soil and sand, how does that appear? Consider making the overlapping formula a separate color (black?) which would highlight formula that were only seen in a subset of the sample types. |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:30Z Line #9. g.figure.subplots_adjust(top=0.8) I would split this into two different chunks! One for the top plot, a second with the bottom. |
View / edit / reply to this conversation on ReviewNB kheal commented on 2024-09-10T20:19:30Z I think these next parts are really cool and the visualizations look awesome, but they make the notebook very long, so I'd vote to cut them. |
I thought the random state setting was the seed? View entire conversation on ReviewNB |
Aha! You're right :) View entire conversation on ReviewNB |
All Submissions:
New Notebook Submissions: