-
Notifications
You must be signed in to change notification settings - Fork 0
bugs in data.tsv (and upstream yaml) from use_modular_gd.py #24
Comments
just put the MIxS term requests second in the This also mostly resolves
Improvements on the NMDC terms could be driven to changes to the nmdc schema, or curations in tab |
example: https://microbiomedata/schema/ecosystem would prefer for that to appear as Current definition of
|
enums included so far, with enrichment success
|
error spotted by @turbomam See also microbiomedata/DataHarmonizer#24
many improvements from #24 Self-merging in order to see in GH pages
Tabulation of ranges for NMDC and MIxS as-is only To-do
enums
Handled already
|
New tabulations:
|
Looks like all tasks here are complete except for the one about hierarchical enums which is covered by other issues. Closing. |
if the
pattern
looks like a list, make it a enumeration/pulldownadd hierarchical indentation of enumerated values
Add support for partial date columns and time columns
align section composition and ordering with @mslarae13's Example Use tab
tidy the
descriptions
env_local_scale
. When MIxS is processed after NMDC, the quote-free MIxS annotations supersede.) See 'env_local_scale' description contains double quotes nmdc-schema#229GOLD sample identifiers
) See 'GOLD sample identifiers' description '[''identifiers for corresponding sample in GOLD'']' after gen-yaml nmdc-schema#228description
on some NMDC terms. See GOLD Path term desciption still "TODO" nmdc-schema#231add meanings for enumerated values
enum_annotator
(see example in Makefile)Ontology ID
sadd more
pattern
s based onpopulate
example
s column?are terms being included even though they are marked skip on
nmdc_biosample_slots
?Ontology ID
gen-yaml
NMDC and MIxS schemas,yq
delete imports sectionsterse
label
s (from apparent prioritization of NMDC over MIxS annotations?)parent class
es with "https:" prefixesalign section composition and ordering
belowfull URLs (prefer prefixed)
aboveseems like number of required fields too low
elaborate on the use of regular expressions in the
guidance
column. Also include the string serialization?Where is the
default
PV in thesample_type
enum coming from@click.option('--default_data_status', default="default", show_default=True)
what does the
Null values
section in the double-click header help mean? see What's the Null values section in the double-click-the-header help? cidgoh/DataHarmonizer#244data status
column indata.tsv
, which I was populating with--default_data_status
take advantage of min and max values for pH (anything else?)
whose id-like fields should be used? The ones from NMDC or ones created by @mslarae13
biosample_identification_slots
The text was updated successfully, but these errors were encountered: