-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
XDK: JN.1.1.1/XBB recombinant(11 seqs, 3 countries) #2415
Comments
Designated XDK via 6e78c60 |
Dear @aviczhl2, |
Currently manual search for reversions on usher https://genome.ucsc.edu/cgi-bin/hgPhyloPlace I'm developing some code to automatically detect recombs on full usher tree. But don't expect fast as I have other things to do. I think @JosetteSchoenma may have other ways. |
Yes, @liamxg . I use some specific queries to find recombinations with usually one breaking point. A query that finds samples that do not have a specific BA.2.86 mutation at the beginning, but do have one at the end. And another one for the other way around. Later on, I adjusted those a little, to exclude specific sublineages etc. If you look In issue #991 you can see which ones I use specifically. Hoping to have some time to update their results today. |
Dear @JosetteSchoenma, Thanks for your reply. hope there are gold reference sequences to be as background, I guess may be a tool for this special problems. |
@liamxg Could you elaborate, please? I do not really understand what you are saying. |
Dear @JosetteSchoenma, I mean some more sensitive tools to detect recombinants intra-VOCs or intra-VOIs or between VOCs and VOIs. |
What about them? |
I was wondering if you are aware of RIVET. Manuscript: https://academic.oup.com/bioinformatics/article/39/9/btad538/7257069 Below is the SNP plot for the closest match to your description that I found on our latest RIVET analysis. The corresponding node id corresponding to this node on the Dec 17 UShER-MAT is node_1453233. It is inferred as a recombinant between JN.1.1.1 with XBB.1.18.1 or XBB.1.28.1 or XBB.1.9.1. The predicted breakpoint interval is (3565,7842) or (3565,6183) or (3565,5720) for the three possible pairs, respectively. The predicted date of origin using Chronumental is 2023-11-12. I have also attached the list of descendants (node_1453233_descendants.txt). Also, we would welcome feedback or suggestions for RIVET to aid your analysis. |
It seems only use designated lineages as reference instead of all lineages on usher? Usually recombs have to be sub-lineage accurate to achieve the fewest private mutations possible. (Here using XBB.1.16.11+C5314T branch gives 0 private mutations while using designated lineages give at least 1 private mutation). |
Hi @aviczhl2, we do indeed use all sequences in the MAT (including ancestral corresponding to the internal nodes of the UShER-MAT) for the recombination analysis. RIVET uses the RIPPLES algorithm (https://www.nature.com/articles/s41586-022-05189-9) for searching exhaustively. The acceptor and donor sequences in RIVET correspond to some internal nodes in the UShER-MAT. Does that answer your question? |
thanks @yatisht for the suggestion and explanation |
Oh, I see. You seem to be using the node before C5314T as reference? |
That is right. Please see the subtree below: RIVET is picking the leftmost node (node_1453233) carrying A405G, A897C, A1762C, and T3431G as the recombinant node. Looks like C5314T mutation happens in its descendant and is not shared by all descendants. UShER subtree: link |
Why do use this? This is a recombinant island. The top large branch is XDD, middle are numerous different singlet BA.2.86*/XBB* recombinants, only the C5314T branch refers to XDK. |
I spoke to @aviczhl2 today over a call and we concluded that RIVET can indeed struggle to accurately identify recombinants in a complicated recombinant island such as this example. However, it can still be useful to locate these recombinant islands. We will try to add some features to flag suspicious recombinant islands. |
Dear @yatisht, |
@yatisht To filter out using spurious branches/islands or artefacts as candidate donors and be more robust, you could try using the Nextclade reference tree composed of only Pango lineages that have been checked to be real. Of course you loose sensitivity, but gain in specificity. Might also make it run much faster. I don't know what the input is that RIVET expects, if it's a MAT one could simply turn the Auspice json into a MAT (json at wget --output-document tree.json "https://nextstrain.org/charon/getDataset?prefix=staging/nextclade/sars-cov-2"). |
Dear @corneliusroemer, |
From sars-cov-2-variants/lineage-proposals#1126
JN.1.1.1+A405G, A897C(rev), A1762C(rev), T3431G(rev), C3565T(rev), C5314T
Orf1a: K47R, D211A(rev), L499F(rev), L1056V(rev)
XBB*--JN.1.1.1
Breakpoint between 5315 and 6182
A405G, A897C(rev), A1762C(rev), T3431G(rev), C3565T(rev), C5314T from XBB*
There are many branches of XBB* with 5314T, unable to specify.
GISAID query: A405G, A6183G, C5314T
No. of seqs: 11(France 9 Ireland 1 Belgium 1)
EPI_ISL_18541014, 2023-11-13, France
EPI_ISL_18636383, 2023-12-10, Belgium
usher
The text was updated successfully, but these errors were encountered: