-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
the parameters to build pangenome for metagenomes #337
Comments
Hi @Wenhai-Zhang, let me try to answer some of your questions, the rest I leave to @AndreaGuarracino.
|
@subwaystation Thank you for quick response and I'm sorry to have time to reply to you now. And thank you for your correct and I'm learning how to express myself better in English.
I know, but as mentioned above, in metagenomes, the ANI of strains from the same specie (such as E.coli) is must greater than 95. That's why I initially set
Thank you very much for sharing the file. This seems not friendly for general server setups.
|
Hi, I ran pggb to build pangenomes for 10 bacterial genomes (Ecoli). The command is
pggb -i merged.fa.gz -o 10n95p -t 20 -p 95 -n 10 -S -m
and it can work well. But i have some confused with the parameters though i have seen the examples in github.I think
-p
is important for me to build pangenomes. I would like to share my simple understanding and i don't know whether it is right. If I set the-p
parameter to 95, and two segments are aligned with a percent identity of 98, which is higer than 95, does this mean that in this region, one of the segments is randomly retained as the unique segment? I would like to understand how it actually work.In metagenomes, ANI 95 is used by default to distinguish different species, and ANI 99.9 is used to distinguish different strains. If my previous understand is right, when I build pangenomes for strains in the same species, I want to retain more information, should I set stricter parameter? For example,
-p 98
or-p 99
.Is my command set suitable to build the pangenome for the same species?
By the way, have you tried to build pangenomes for a few thousands of bacterial genomes? I would like to know how long it will take approximately?
Perhaps my questions are a bit annoying, but I really need your help.
Thank you in advance.
The text was updated successfully, but these errors were encountered: