ENH: Split up deletions and substitutions in tooltip as deletions are overwhelming and uninformative #1537

corneliusroemer · 2022-07-22T12:10:58Z

Context

When inspecting trees I often have issues with the way the presentation of deletions overwhelms the display.

Description

Deletions are often very noisy on trees as we don't account for them properly when building a tree.

Yet they take up most space because they come in stretches that aren't compressed.

Examples

So I end up with a result like this:

Where the needle (the two really important nuc substitutions) is really hard to find in the hay stack (long list of deletions).

It would be really great and quite high priority from my user perspective to make deletions less overwhelming.

This has actually impeded me for a while but I never had the idea of writing it up as an issue.

Possible solution

@victorlin do you think this is something you could have a look at? There are a few things we could try here:

Simply split out substitutions and deletions (maybe easiest and quickest, maybe stopgap until we have)
Compress deletion stretches (and maybe separate them out, too)

jameshadfield · 2022-09-12T01:51:04Z

Related (private) slack thread here. A summary of this:

Auspice already separates out deletions nicely (e.g. G42-) and groups them into runs. This problem is due to "undeletions" which are almost certainly a bioinformatics problem, albeit a hard one to fix. @corneliusroemer suggested to call these a “reversion of deletion to reference" which seems like a good solution. Some considerations:

We should group these together into runs, like we do with deletions.
They should not be listed in the "unique mutations", "Homoplasies" and "Reversions to root" categories
I don't think these should be restricted to reversion of deletion to reference - they should be any deletion to a base. Although this would obscure the interesting case when it's a base which differs from the ancestral node...

corneliusroemer added the enhancement New feature or request label Jul 22, 2022

corneliusroemer added this to Nextstrain planning (archived) Jul 22, 2022

corneliusroemer moved this to New in Nextstrain planning (archived) Jul 22, 2022

huddlej moved this from New to Backlog in Nextstrain planning (archived) Jul 26, 2022

jameshadfield mentioned this issue Sep 15, 2022

Feat/undeletions #1542

Merged

jameshadfield closed this as completed in #1542 Sep 15, 2022

Repository owner moved this from Backlog to Done in Nextstrain planning (archived) Sep 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Split up deletions and substitutions in tooltip as deletions are overwhelming and uninformative #1537

ENH: Split up deletions and substitutions in tooltip as deletions are overwhelming and uninformative #1537

corneliusroemer commented Jul 22, 2022

jameshadfield commented Sep 12, 2022

ENH: Split up deletions and substitutions in tooltip as deletions are overwhelming and uninformative #1537

ENH: Split up deletions and substitutions in tooltip as deletions are overwhelming and uninformative #1537

Comments

corneliusroemer commented Jul 22, 2022

Context

Description

Examples

Possible solution

jameshadfield commented Sep 12, 2022