Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Whether the number of contigs affects the efficiency of BISER #35

Open
life404 opened this issue Dec 21, 2023 · 1 comment
Open

Whether the number of contigs affects the efficiency of BISER #35

life404 opened this issue Dec 21, 2023 · 1 comment

Comments

@life404
Copy link

life404 commented Dec 21, 2023

Hi,
First, thank for you tools. I run the biser on a genome assembly with 25 chromosome-level scaffolds and 1983 short contigs (the longest contig is ~ 300kb and the shortest is ~2kb). The biser run a long time (> 2 Days) and used up the memory (> 1 Tb). However, it takes a very short time on other genome with only 194 contigs.

So I try to use the SEDEF to analysis the genomes with a high number of contigs. I observed that SEDEF also required a substantial amount of time for short contigs processing. For instance, after performing SEDEF translation, the group 22 (consisting of more than 800 contigs) and group 23 (with over 300 contigs) took more than 48 hours to complete processing.

Will a large number of short contigs significantly increase runtime? Should I filter out shorter contigs (e.g., < 10kb)? Could you please provide some suggestions?

Thank You

@inumanag
Copy link
Contributor

Hi @life404

How are you running BISER? Are you contigs soft-masked with RepeatMasker?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants