Skip to content

Commit

Permalink
Desigante JN.1.39 (G2782T)
Browse files Browse the repository at this point in the history
  • Loading branch information
corneliusroemer committed Mar 25, 2024
1 parent 4c84077 commit c193ce6
Show file tree
Hide file tree
Showing 2 changed files with 2,674 additions and 2 deletions.
1 change: 1 addition & 0 deletions lineage_notes.txt
2,675 changes: 2,673 additions & 2 deletions lineages.csv

4 comments on commit c193ce6

@xz-keg
Copy link
Contributor

@xz-keg xz-keg commented on c193ce6 Mar 25, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The correct trajectory of JN.1.33 shall be JN.1->G2782T->T111C->C5512T->S:A67V. Usher places it in a wrong place because some seqs have missing coverage on position 111.

So this one(JN.1.39) shall be parent of JN.1.33. Maybe designation shall be changed to JN.1.39.1 for JN.1.33.

@FedeGueli
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cc @corneliusroemer @AngieHinrichs important comment by @aviczhl2 that first noticed this thing

@AngieHinrichs
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

JN.1.33 is currently placed at JN.1 > C5512T > G2782T > C21762T. JN.1 > C5512T has 90 sequences, with dates starting ~2023-11-09 (preceding JN.1.33 dates), without G2782T etc. So it's not impossible that C5512T came first for these. Why should two occurrences of G2782T be less plausible than two occurrences of C5512T? I think this is one of many cases where we can't tell between two occurrences of one mutation, two occurrences of a different mutation, or some other thing like recombination.

Also, the pangolin program masks all sites in the untranslated regions (UTR) at the beginning and end of the genome (1-265 and 29674-29903), so pangolin masks position 111, so that can't be part of the definition of a lineage that pangolin is going to call. I could mask position 111 in JN.1 in order to simplify these and perhaps make a hope of joining them, but still parsimony can't tell the difference between C5512T > G2782T and G2782T > C5512T when there are some sequences that have only C5512T, some sequences that have only G2782T, and some sequences that have both.

@xz-keg
Copy link
Contributor

@xz-keg xz-keg commented on c193ce6 Apr 13, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

JN.1.33 is currently placed at JN.1 > C5512T > G2782T > C21762T. JN.1 > C5512T has 90 sequences, with dates starting ~2023-11-09 (preceding JN.1.33 dates), without G2782T etc. So it's not impossible that C5512T came first for these. Why should two occurrences of G2782T be less plausible than two occurrences of C5512T? I think this is one of many cases where we can't tell between two occurrences of one mutation, two occurrences of a different mutation, or some other thing like recombination.

Also, the pangolin program masks all sites in the untranslated regions (UTR) at the beginning and end of the genome (1-265 and 29674-29903), so pangolin masks position 111, so that can't be part of the definition of a lineage that pangolin is going to call. I could mask position 111 in JN.1 in order to simplify these and perhaps make a hope of joining them, but still parsimony can't tell the difference between C5512T > G2782T and G2782T > C5512T when there are some sequences that have only C5512T, some sequences that have only G2782T, and some sequences that have both.

Don't need that, just add the T111C to seqs on JN.1 > C5512T > G2782T > C21762T but without T111C due to missing coverage on 111.
I guess usher will automatically change the tree as G2782T>T111C is 2 mutations while C5512T is only 1 mutation.

Please sign in to comment.