-
Notifications
You must be signed in to change notification settings - Fork 163
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feat/undeletions #1542
Feat/undeletions #1542
Conversation
Undeletions are inferred mutations from a gap to a base, e.g. `-400A`. In most cases these are bioinformatics errors, so this commit collects them into a separate category to enhance interoperability of lots of mutations. The actual base is visible if you click to copy the mutation list to the clipboard, but in my testing on nCoV datasets this is always the reference base.
Work prompted by the preceeding commit, but I don't think any changes are actually needed. Note that depending on how the dataset was generated you can get what look like undeletions when the root sequence is a gap itself. I think this is rarer than undeletions on branches so not worth categorising at the present time. Added tests as the clearest way to document the behaviour here.
We could say 'deletion reversion' rather than 'undeletion' - just depends on space I suppose! One could extend the description of gaps to say something like "A change to a '-' character, indicating a missing base. These can indicate deletions but sometimes areas of no coverage are filled with gaps." - but I've no strong feelings and this may be too wordy. Overall I think this looks good, I like the mouseover option to see things more clearly! Thanks James! |
ff9d0d4
to
8c7b60c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Really happy to see the tooltip for the mutations! IIRC, we've had many questions regarding what the mutation labels mean.
8c7b60c
to
ab21e4e
Compare
Closes #1537, which has context on the rational for this PR.
Closes #1469
Undeletions (can we think of a better word than "undeletion"?!?) on branches are split out into a separate category and grouped into intervals, like so:
This is not done for tips, as these don't report mutations but rather changes between the root and the tip. While undeletions can be present here, it's rarer and I don't think any of our core builds would have this. See the tests introduced in 9fa1298 for examples of this.
The third commit adds a info-popup detailing what the different categories are and how mutations are grouped into them.