-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Potential BA.1/BA.2 Recombinant Lineage with Likely Breakpoint at NSP5/NSP6 (267 Sequences in the UK and Ireland) #454
Comments
20 new sequences uploaded yesterday. 165 in total as of March 1st, 2022. |
208 sequences now and just popped out in UK-Wales. |
267 sequences as of 2022-03-08. Also found in Wales. |
How do you monitor the number of sequences @c19850727? Could you share the covspectrum query? That'd be great. |
@corneliusroemer Sure I used to use C3241T, T5386G, C12880T, C14599T, C15714T, A20055G, A29510C, Basically C14599T is the key. |
@AngieHinrichs I think these get excluded from the Usher tree atm? (I guess because they look like artefacts) Would one have to manually place a few for them to start appearing? We have looked into a bit at our side as data-generators, and they do seem legit |
@theosanderson Yes, it looks like they are being excluded from the tree because of some new quality filters that I added at the beginning of Feb. 2022 to deal with Omicron sequence quality / amplicon dropout / assembly issues. I have a file of IDs that I'm gleaning from pango-designation issues and exempting from those checks, but a) it's manually maintained and I have fallen a bit behind this week and b) I need to translate EPI_ISL_ IDs into GenBank IDs where possible and sometimes there's a lag before the GenBank ID becomes available so there's another way for my file to fall behind. It looks like the 2022-03-09 tree has 1 GISAID and 18 GenBank IDs but there should be a lot more.
Those names/IDs can be pasted into the UShER web interface, but yeah, if you want to see a tree with sequences other than those today you'll need to upload fasta or run usher locally. I am working on updating the exemption-file today. The sequences should be in the 2022-03-11 or 2022-03-12 build (which should become available in another 2 or 3 days). (There are still other things that cause sequences to be omitted from the tree, I'll keep an eye out for those too.) Thanks @c19850727 for updating the file Genomes 20220309.txt in the description! |
Thanks for the info and amazing work as ever @AngieHinrichs, I now have a much better sense of how that works! |
In the last 13 days, the growth advantage with respect to BA.2* in the UK seems to have reduced. The growth advantage could possibly be due to something akin to a "funnel plot" bias, anything we observe will show growth advantage at first, otherwise it wouldn't be observed. Will be interesting to keep an eye on, here's the query. |
Yes @corneliusroemer , and if we filter it to UK from Jan. 13th, the growth advantage would be 10% only. |
Thanks @c19850727 We've added this as lineage XE in v1.2.133 |
Great take Saka! Ukhsa today estimated it in 9,8% |
Description
Recombinant between: BA.1* & BA.2
Earliest sequence: 2022/1/16 (UK-England)
Most recent sequence: 2022/2/21 (UK-England)
Countries circulating: UK (England and Scotland), Ireland
Likely breakpoint: between 10448 and11287 (NSP5 or NSP6).
Conserved Nuc mutations and AA changes (those in red frames are likely from the donor from the BA.1 side):
Private mutations: C14599T
Evidence
Nextclade downsampled tree (unrooted):
Nextclade downsampled tree (starting from 21M):
Usher tree (interestingly Usher puts #448 in the sibling branch):
https://nextstrain.org/fetch/genome-euro.ucsc.edu/trash/ct/subtreeAuspice1_genome_euro_b767_d79a40.json?branchLabel=aa%20mutations&c=userOrOld&label=nuc%20mutations:G8393A,C25624T
International comparison, geographical distribution and relative growth advantage as per Cov-spectrum:
Relative growth advantage over BA.2 in the UK from Jan. 1st, 2022 as per Cov-spectrum:
(Cov-spectrum somehow only shows 118 sequences out of a total of 145)
Genomes:
Genomes 20220309.txt
The text was updated successfully, but these errors were encountered: