-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH(nextclade cli): nextclade dataset list: indicate whether clades can be assigned #1458
Comments
Hi Angie,
We released 3.6.0 just earlier today where clades become optional even if the tree is present. And previously our folks used empty string in place of Currently I'd say downloading the tree and looking if there's at least one In the official datasets in the data repo, when rebuilding the dataset index, we could enumerate datasets "capabilities". I have some basics emitted into the index.json of the dataset server, but not clade assignment. Might be a good addition. Do you have any other such capabilities in mind that we could add? I am having difficulties imagining how that would look from the user perspective, as me myself I don't use Nextclade often :) Once we have a list of capabilities in the index, the |
We should also not forget about clade-like attributes which may also be present on the tree in |
The tree-related capabilities could be computed in the rebuild script somewhere around here, I guess |
Yes, if you could add |
Implemented in #1473 and nextstrain/nextclade_data#205 |
Released in 3.7.0 |
Fantastic, thanks! The types and counts are really helpful! |
In the output of
nextclade dataset list
it would be very helpful to have an indication of whether clades can be assigned using each dataset. For example, dataset nextstrain/flu/h3n2/ha/EPI1857216 can assign clades, but nextstrain/flu/h3n2/pb1 cannot (it has no tree.json). Currently, in order to determine that, I need to download each dataset and look for tree.json.Does the presence of tree.json in a dataset always mean that clades can be assigned? If so, then hopefully it would be straightforward for
nextclade dataset list
to report whether pathogen.json includes treeJson.The text was updated successfully, but these errors were encountered: