Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (104 seq, Jan 25) #1385

Closed
ryhisner opened this issue Dec 1, 2022 · 18 comments
Assignees
Labels
BQ.1 designated Saltation Appears on long branch length with no intermediates
Milestone

Comments

@ryhisner
Copy link

ryhisner commented Dec 1, 2022

Description
Sub-lineage of: BQ.1.1
Earliest sequence: 2022-9-15, Malaysia, EPI_ISL_15942617
Most recent sequence: 2022-11-15, Indonesia — EPI_ISL_15966070
Countries circulating: Indonesia (8), Malaysia (1)
Number of Sequences: 9
GISAID Query: Spike_K182E, Spike_Q218H, Spike_P1162L
CovSpectrum Query: Nextcladepangolineage:BQ.1* & S:K182E & S:Q218H
Substitutions on top of BQ.1.1.:
Spike: K182E, Q218H, P251L, P1162L (and A829S for 4/9 sequences)
Nucleotide: T19293C, A22106G, G22216T, C22314T, T23716C, C25047T, A28834G

USHER Tree
The bottom branch here appears to lack S:P251L but that is almost certainly an artifact. All four sequences lack coverage in that area. Also, the long branch is 100% artifactual, the seven “reversions” obviously being due to lack of sequencing coverage in the RBM. Only the ORF1a:P1947S (C4754T) mutation on that branch is real.
https://nextstrain.org/fetch/raw.githubusercontent.com/ryhisner/jsons/main/BQ.1.1%20%2B%20K182E%2C%20Q218H%2C%20P251L%2C%20P1162L%20-%20subtreeAuspice1_genome_26c49_8934f0.json
image

Evidence
The first sequence in this lineage is the only one from Malaysia and was collected on September 15, though it was only uploaded November 29. The other eight sequences, all from Indonesia, were collected nearly two months later—one on Nov 2 and the rest between Nov 11-15. There are many countries in this region with little to no sequencing, so it seems likely this lineage has been circulating in significant numbers somewhere in SE Asia.

There is really not much diversity in this lineage considering the two-month time difference between the Malaysian sequence and the others. As stated above, the long branch on the tree is definitely an artifact.

Notably, four of the more recent sequences (collection dates Nov 2, 11, 11, 15) also have S:A829S. It's possible that A829S is also present in the other four Indonesian sequences but is not indicated due to the relatively poor quality of those sequences.

Genomes

Genomes EPI_ISL_15754441, EPI_ISL_15826723, EPI_ISL_15901232, EPI_ISL_15901237, EPI_ISL_15901277, EPI_ISL_15901279, EPI_ISL_15942617, EPI_ISL_15966062, EPI_ISL_15966070,
@ryhisner
Copy link
Author

ryhisner commented Dec 6, 2022

The first sequence from Singapore showed up today—a local case collected on November 26. It does not have S:A892S. Brings the current total number of sequences to 10.
EPI_ISL_16016839

@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (9 seq, ≥4 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (10 seq, ≥4 with S:A829S) Dec 6, 2022
@ryhisner
Copy link
Author

ryhisner commented Dec 7, 2022

Two more from Indonesia today. Both lacking coverage at S:251, but every sequences that has had coverage there has had S:P251L, so these presumably do as well. Total sequences now 12. EPI_ISL_16027258, EPI_ISL_16027287

@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (10 seq, ≥4 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (12 seq, ≥4 with S:A829S) Dec 7, 2022
@ryhisner
Copy link
Author

ryhisner commented Dec 11, 2022

One sequence from Iceland now, collection date December 5. It has S:A829S. EPI_ISL_16053824

@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (12 seq, ≥4 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (13 seq, ≥5 with S:A829S) Dec 12, 2022
@thomasppeacock thomasppeacock added the Saltation Appears on long branch length with no intermediates label Dec 12, 2022
@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (13 seq, ≥5 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (14 seq, ≥5 with S:A829S) Dec 13, 2022
@corneliusroemer
Copy link
Contributor

Interesting stuff, let's watch a bit more to see where to delineate

@corneliusroemer corneliusroemer added the monitor currently too small, watch for future developments label Dec 15, 2022
@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (14 seq, ≥5 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (17 seq, ≥5 with S:A829S) Dec 15, 2022
@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (17 seq, ≥5 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (20 seq, ≥9 with S:A829S) Dec 20, 2022
@ryhisner
Copy link
Author

First sequence from Germany came in yesterday, non-A892S branch. EPI_ISL_16218926

@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (20 seq, ≥9 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (21 seq, ≥9 with S:A829S) Dec 22, 2022
@ryhisner
Copy link
Author

Six sequences uploaded from Singapore today bring the total number of sequences to 28. All six have S:A829S. Collection dates Dec 16-18.

Oddly, two have S:T444M. We've seen S:T444M occasionally in various BQ* branches. I think it's probably slightly deleterious, but it's a C-->T nuc mutation, which I think accounts for it happening every now and then. Probably APOBEC'd.

@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (21 seq, ≥9 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (28 seq, ≥15 with S:A829S) Dec 23, 2022
@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (28 seq, ≥15 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (36 seq, ≥19 with S:A829S) Dec 27, 2022
@ryhisner
Copy link
Author

Eight new sequences uploaded from this lineage so far today, including the first sequences from Thailand (2) and Japan (1). This brings the total to 36 sequences. The four most recently collected sequences uploaded today are from Singapore, and they all have S:A829S. The other four do not have S:A829S.

@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (36 seq, ≥19 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (38 seq, ≥19 with S:A829S) Dec 30, 2022
@ryhisner
Copy link
Author

The first sequence from North America was uploaded today from Canada, as was the second sequence from Germany. The Canadian sequence, along with all the others that appear to lack S:P251L on the Usher tree, lacks coverage in that region of spike, so I think it's safe to say that all 38 sequences do in fact possess S:P251L. The "long" branch that's cut off in the picture below is full of false "reversions." If those two sequencing artifacts are taken into account, the diversity in this lineage is much less than what it appears to be if you just look at the tree.
image

@ryhisner
Copy link
Author

ryhisner commented Jan 3, 2023

Four new sequences uploaded today—including the first two from England and the first from the USA (Texas)—bring the total to 43 sequences.

@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (38 seq, ≥19 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (43 seq, 20 with S:A829S) Jan 3, 2023
@ryhisner
Copy link
Author

ryhisner commented Jan 5, 2023

One more sequence from the USA (Texas) and three more from Canada today, one of which has S:Y144del, a first for this lineage. Total 47 sequences.

@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (43 seq, 20 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (47 seq, 20 with S:A829S) Jan 5, 2023
@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (47 seq, 20 with S:A829S) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (47 seq, 20 with S:A829S, Jan 4) Jan 5, 2023
@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (47 seq, 20 with S:A829S, Jan 4) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (68 seq, Jan 9) Jan 9, 2023
@ryhisner
Copy link
Author

ryhisner commented Jan 9, 2023

12 new sequences of this lineage today: six from Canada (Ontario) and six from England. This brings the total to 68 sequences, an increase of 21 in the past four days.

@thomasppeacock thomasppeacock removed the monitor currently too small, watch for future developments label Jan 11, 2023
@thomasppeacock thomasppeacock added recommended Recommended for designation by pango team member and removed BA.5 labels Jan 11, 2023
@ryhisner
Copy link
Author

Now up to 87 sequences. Six from England and one from Scotland uploaded so far today. There's a branch in Northern Ireland with S:D111N as well.

@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (68 seq, Jan 9) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (87 seq, Jan 16) Jan 16, 2023
@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (87 seq, Jan 16) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (90 seq, Jan 17) Jan 18, 2023
@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (90 seq, Jan 17) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (97 seq, Jan 22) Jan 22, 2023
@ryhisner ryhisner changed the title Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (97 seq, Jan 22) Sublineage of BQ.1.1 with S:K182E, S:Q218H, S:P251L, S:P1162L (104 seq, Jan 25) Jan 25, 2023
@InfrPopGen InfrPopGen self-assigned this Jan 27, 2023
InfrPopGen added a commit that referenced this issue Jan 27, 2023
Added new lineage BQ.1.1.39 from #1385 with 86 new sequence designations, and 0 updated
@InfrPopGen InfrPopGen added this to the BQ.1.1.39 milestone Jan 27, 2023
@InfrPopGen InfrPopGen added designated and removed recommended Recommended for designation by pango team member labels Jan 27, 2023
@InfrPopGen
Copy link
Contributor

Thanks for submitting. We've added lineage BQ.1.1.39 with 86 newly designated sequences, and 0 updated. Defining mutations A22106G (S:K182E), G22216T (S:Q218H) (following T19293C, T23716C, A28834G). There was no clear extra growth advantage at this time with S:P251L or S:A829S, and the chosen node also coincides with a shift from mainly western European to a more cosmopolitan detection.

@jinyu-ncbi
Copy link

BQ.1.1.28, BQ.1.1.38, BQ.1.1.39 all have S:P251L. It seems a lineage should be designated to the one with S:P251L, which has sublineages BQ.1.1.28, BQ.1.1.38 and BQ.1.1.39.

@FedeGueli
Copy link
Contributor

BQ.1.1.28, BQ.1.1.38, BQ.1.1.39 all have S:P251L. It seems a lineage should be designated to the one with S:P251L, which has sublineages BQ.1.1.28, BQ.1.1.38 and BQ.1.1.39.

Did you check they are monophyletic stemming out from th same S:251L politomy or instead it is just homoplasy (that i think is likely)?

@jinyu-ncbi
Copy link

Looks to me S:P251L appeared before other mutations:

S:P251L (C22314T)
-> C18828T, C16393T -> C6701T (BQ.1.1.28)
-> A24129G, T7633C, A19703G, C19269T, C23664T (BQ.1.1.38)
-> T19293C, A22106G, G22216T, T23716C, C25047T, A28834G (BQ.1.1.39)
-> T18383C, T358C (not designated)

It is less likely other mutations appeared first, and then a single mutation C22314T was added to all these lineages.

@AngieHinrichs
Copy link
Member

The branch with BQ.1.1 > C22314T has 393 sequences. Scanning by eye I see dates from 2022-09-29 to 2023-01-16. BQ.1.1.38 (261 sequences) is on that branch: C22314T > T7633C > A24129G.

The branch to BQ.1.1.28 starts with BQ.1.1 > C18828T -- there are about 1,000 sequences with just C18828T with dates from 2022-09-14 (USA/CA-CDC-STM-UCMS9EZJ6/2022 which is a couple mutations after C18828T). Then BQ.1.1 > C18828T > C16393T has only 13 sequences (including Ghana and Chile 2022-10-07) so not as much data there to guess the real date range. Then BQ.1.1.28 (C18828T > C16393T > C22314T) has 752 sequences; the earliest I see is 2022-09-21 (Israel/ICH-741187816/2022) although next after that is 2022-09-23 from Ghana (Ghana/NMIMR-CT-22-2226/2022). So based on the number of sequences with BQ.1.1 > C18828T first, and the progression of dates, I think that order of mutations is plausible.

BQ.1.1.39 is BQ.1.1 > C25047T > T19293C,T23716C,A28834G > A22106G,G22216T. There are ~211 non-BQ.1.1.39 sequences on BQ.1.1 > C25047T, earliest England/PLYM-3258AEE8/2022|2022-09-30 (with a couple of other mutations). BQ.1.1.39 dates start at Malaysia/IMR_OS7240/2022|2022-09-15. To me that indicates unsampled spread of C25047T before that date. From that point there were ~80 sequences from SE Asia, Canada and Europe that had BQ.1.1, C25047T, T19293C,T23716C,A28834G, A22106G, and G22216T -- but not C22314T. BQ.1.1.39 has a branch of 25 sequences that has all those mutations, plus C11020T, G18445A, C28948T. Then finally, there is a branch of 23 of the 25 sequences that also have C22314T. Those are all in the US and Canada and have dates from 2022-12-12 onward. It sure seems to me like C22314T happened after all of those BQ.1.1.39 mutations.

I was just reminded that it's Friday night so I'll leave it at that -- but yeah, like @FedeGueli said, homoplasy seems likely.

@jinyu-ncbi
Copy link

jinyu-ncbi commented Jan 30, 2023

Thank you, @AngieHinrichs. It seems the sequence records we see are slightly different. Anyway, it is possible that C22314T is homoplasy that appeared independently in BQ.1.1.28, BQ.1.1.38, BQ.1.1.39. If C18828T showed up earlier than other mutations (including C22314T) in BQ.1.1, then the C18828T branch (we have 901 sequences) have at least the following four sublineages:

  1. C22314T (S:P251L), C16393T (helicase:P53S) > C6701T (nsp3:L1328F): 408 sequences.
  2. C28138T (ORF8:S82F), C27944T, G17679T, C10228T, C1377A, (nsp2:P191Q) > G20931T: 175 sequences.
  3. T22119C (S:F186S), G28904A (N:211T), G4255A, C1627T, C21648T (S:T29I): 156 sequences.
  4. A1202T (nsp2:N133Y), C17436T > C5497T: 78 sequences.

I don’t know if lineages have been added or proposed for them.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BQ.1 designated Saltation Appears on long branch length with no intermediates
Projects
None yet
Development

No branches or pull requests

7 participants