Convey uncertainty via tip colors #1796

jameshadfield · 2024-06-28T01:12:31Z

The previous code conveyed uncertainty in node attrs for branches by making them appear grey-er, but we never implemented this for tips; most likely because we never had a dataset with such data when this was built.

Here we use the same approach for tips as for branches, but with a slightly different parameterisation of the interpolation. The mapping of the entropy value into [0,1] (tipOpacityFunction) was chosen so that tips with no (or very little) uncertainty look unchanged from previous Auspice versions, and uncertainty makes them appear more similar to the branch colour (for an equivalent uncertainty).

There should be no visible changes for views without any uncertainty (genotype is a good one to use to test this), as well as traits where there is uncertainty in the dataset but not in the tips (e.g. ebola country / division reconstruction). Here's a side-by-side with the h5n1-cattle-flu dataset from nextstrain/avian-flu#66, which identified this issue in Auspice (this PR LHS, current Auspice RHS):

Checks pass (CI passes, RTD is currently broken however that's unrelated to this PR)
If making user-facing changes, add a message in CHANGELOG.md summarizing the changes in this PR
(to be done by a Nextstrain team member) [Create preview PRs on downstream repositories][1].

Consuming code expected this to be a boolean and all actions which update this state set a boolean. Thankfully this means the default state was never used in practice.

jameshadfield · 2024-06-28T01:16:07Z

src/util/colorHelpers.js

  .clamp(true);
+const tipOpacityFunction = branchOpacityFunction
+  .copy()
+  .range([0, 0.9]); // if entropy close to 0 return the original node color


The range (and the domain) of this scale are the magic values which control how entropy values affect the stroke colour (via an interpolation between the original colour & grey). The tip fill colour is a brighter version of the stroke colour. Very happy for adjustments here, although I think it's important that tipOpacityFn(close-to-zero) -> 0 so that we don't change how the majority of datasets appear.

jameshadfield · 2024-06-28T02:06:10Z

Some URLs to compare this PR on nextstrain.org vs released Auspice on nextstrain.org:

Cattle-flu new & old

Zika (country) new & old. Note that this does have uncertainty for tips, which is kind of strange, but that's how augur traits currently works.

H3N2 (genotype view) new & old - no uncertainty here.

joverlee521

How would one be able to differentiate the grey scale for uncertainty vs grey scale for unprovided colorings? For example, imagine if zika's region had uncertainty, it would be mixed in with the "Asia" grey colorings.

(The default "Asia" coloring issue will be fixed when augur is released with nextstrain/augur#1490, but the question still stands for any other unprovided colorings)

joverlee521 · 2024-06-28T18:03:20Z

src/util/colorHelpers.js

 * @param {bool} confidence enabled?
 * @return {array} array of hex's. 1-1 with nodes.
 */
-export const calcBranchStrokeCols = (tree, confidence, colorBy) => {
+export const calculateStrokeColors = (tree, branch, confidence, colorBy) => {


not a request for change, just curious

Why combine branch/tip colors into one function with the branch flag when they are essentially completely different paths within the function?

While they are ~separate code paths, they both do the same thing: take a colour and modify it according to the node's uncertainty. Co-locating them feels natural to me and should help them to stay in-sync.

I actually thought about taking this further and refactoring it into a function calculateNodeColors -> {branchColors, tipStrokeColors, tipFillColors} so we calculate everything at once, but I'll leave that for another day (and I need to check that we never have a situation where we only need to recompute one of those sets).

trvrb · 2024-06-28T18:50:23Z

Awesome! This behavior looks spot on to me. Here's the current H5N1 cattle outbreak

Note that recent SRA tips are not completely gray. For example, the top clade descends from viruses sampled from South Dakota. These are appropriately colored a gray/green indicating potential South Dakota, but with little certainty.

The interpolation between SRA tips close to known Ohio viruses in blue to the the Michigan human case in lime also seems very appropriate.

I think Auspice is now doing exactly what it should be doing. However, we still should have a way to have a more data-informed decision about how to set --sampling-bias-correction. We could be doing leave 10% out cross validation as Gytis did in the 2019 BMC Evol Biol paper.

trvrb · 2024-06-28T19:05:53Z

How would one be able to differentiate the grey scale for uncertainty vs grey scale for unprovided colorings? For example, imagine if zika's region had uncertainty, it would be mixed in with the "Asia" grey colorings.

This is a really good point @joverlee521. The issue is that currently we use gray to mean either:

Unknown or uncertain
Uninteresting

This uninteresting take can be seen here https://nextstrain.org/ncov/gisaid/north-america/6m@2020-05-01 for example. This felt semantically appropriate to distinguish focal samples from background samples.

I think this is okay however... This example does DTA on samples with a focal vs contextual color ramp so that uncertain nodes and contextual nodes are both gray. This feels okay and appropriate (perhaps not ideal, but not broken). It highlights clades that are more certain to be in a focal region.

That said, we should be fixing colorings like the Zika example so that random location is not gray. In the Zika example, "Asia" should be blue, like it is for country.

The previous code conveyed uncertainty in node attrs for _branches_ by making them appear grey-er, but we never implemented this for _tips_; most likely because we never had a dataset with such data when this was built. Here we use the same approach for tips as for branches, but with a slightly different parameterisation of the interpolation. The mapping of the entropy value into `[0,1]` (`tipOpacityFunction`) was chosen so that tips with no (or very little) uncertainty look unchanged from previous Auspice versions, and uncertainty makes them appear more similar to the branch colour (for an equivalent uncertainty).

jameshadfield · 2024-07-01T00:15:31Z

How would one be able to differentiate the grey scale for uncertainty vs grey scale for unprovided colorings? For example, imagine if zika's region had uncertainty, it would be mixed in with the "Asia" grey colorings.

Very difficult at the moment! Let's continue discussion in [maybe] differentiate between nodes with uncertainty vs nodes missing from colour scale

[color-by confidence] fix default redux state

95172c3

Consuming code expected this to be a boolean and all actions which update this state set a boolean. Thankfully this means the default state was never used in practice.

nextstrain-bot temporarily deployed to auspice-james-uncertain-wu7geg June 28, 2024 01:12 Inactive

jameshadfield commented Jun 28, 2024

View reviewed changes

jameshadfield force-pushed the james/uncertain-tip-attrs branch from 57737fc to f9e654f Compare June 28, 2024 01:19

nextstrain-bot temporarily deployed to auspice-james-uncertain-wu7geg June 28, 2024 01:20 Inactive

jameshadfield added the preview on nextstrain.org label Jun 28, 2024

nextstrain-bot mentioned this pull request Jun 28, 2024

[bot] [DO NOT MERGE] Test Auspice PR 1796 nextstrain/nextstrain.org#936

Closed

joverlee521 reviewed Jun 28, 2024

View reviewed changes

jameshadfield force-pushed the james/uncertain-tip-attrs branch from f9e654f to c2ffa90 Compare June 30, 2024 23:50

nextstrain-bot temporarily deployed to auspice-james-uncertain-wu7geg June 30, 2024 23:50 Inactive

jameshadfield merged commit 3e818f0 into master Jun 30, 2024
25 of 26 checks passed

jameshadfield deleted the james/uncertain-tip-attrs branch June 30, 2024 23:56

jameshadfield mentioned this pull request Jul 1, 2024

[maybe] differentiate between nodes with uncertainty vs nodes missing from colour scale #1797

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convey uncertainty via tip colors #1796

Convey uncertainty via tip colors #1796

jameshadfield commented Jun 28, 2024 •

edited

Loading

jameshadfield Jun 28, 2024

jameshadfield commented Jun 28, 2024

joverlee521 left a comment

joverlee521 Jun 28, 2024

jameshadfield Jul 1, 2024

trvrb commented Jun 28, 2024

trvrb commented Jun 28, 2024

jameshadfield commented Jul 1, 2024

Convey uncertainty via tip colors #1796

Convey uncertainty via tip colors #1796

Conversation

jameshadfield commented Jun 28, 2024 • edited Loading

jameshadfield Jun 28, 2024

Choose a reason for hiding this comment

jameshadfield commented Jun 28, 2024

joverlee521 left a comment

Choose a reason for hiding this comment

joverlee521 Jun 28, 2024

Choose a reason for hiding this comment

jameshadfield Jul 1, 2024

Choose a reason for hiding this comment

trvrb commented Jun 28, 2024

trvrb commented Jun 28, 2024

jameshadfield commented Jul 1, 2024

jameshadfield commented Jun 28, 2024 •

edited

Loading