Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Edges in the graph are not updated with refound genes #141

Closed
hdore opened this issue Dec 3, 2021 · 3 comments
Closed

Edges in the graph are not updated with refound genes #141

hdore opened this issue Dec 3, 2021 · 3 comments

Comments

@hdore
Copy link

hdore commented Dec 3, 2021

Hello,

Thank you for developing panaroo, it's great to have a tool to explore the pangenome as a graph!

I was exploring a graph made from 16 whole bacterial genomes and I'm having troubles interpreting some of the graph features.

In the example below, the 3 highlighted nodes all have a size of 16, with a single copy of each of these genes in each genome of my dataset. So I don't understand how there can be 2 paths going from galE to group_1247 (galE>1247 and galE>3811>1247).

image

Thank you for your help,

hdore

@hdore
Copy link
Author

hdore commented Dec 3, 2021

Hello,
I think I found a possible explanation: one of the genes in group_3811 is a refound gene. The edge galE>1247 is supported by only 1 genome, which corresponds to the genome with the refound gene.
Thus it seems that the edges in the graph are not updated with the refound genes. Is there a reason behind this? I find it quite misleading, in particular when looking at potential structural variants, since we are loosing the power of genes refinding from Panaroo.

Best,

hdore

@hdore hdore changed the title Need help interpreting the graph Edges in the graph are not updated with refound genes Dec 3, 2021
@gtonkinhill
Copy link
Owner

Hi,

Yes, currently the edges are not updated to reflect refound genes. This is a bug (originally reported in #83) that I'm hoping to fix in future versions. It does not impact on the building of the graph and presence/absence matrices but can make interpretation more difficult. I will try and patch this soon.

@gtonkinhill
Copy link
Owner

I will close this as it is a duplicate of #83 but will try get to it soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants