Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Path id and clustering optimization #6

Merged
merged 8 commits into from
Jul 14, 2020
Merged

Conversation

jonassibbesen
Copy link
Owner

The following PR contains these major changes:

  • Path ids are now located during inference instead of when parsing the alignment paths. With this change all ids is not stored in memory at the same time anymore thus decreasing overall memory usage significantly.
  • Inference path clusters are now by default inferred from the paths and not the reads. The read based clustering is only needed if multi-maps are used, which is currently not supported. Furthermore, the clustering is now also multi-threaded.
  • Probabilities are now collapsed during matrix construction. This reduces peak memory for really large clusters.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant