Automatically generate and cache minimap2 indexes to eliminate redundant indexing overhead #39

bede · 2024-06-14T16:02:42Z

When initially implementing long read support, I was unable to demonstrate significantly reduced execution time versus recreating the index from scratch every time hostile clean is called. Using a prebuilt index was only marginally quicker and not worth the complexity of managing indexes. However, recently I tested whether this is still the case and observed that running hostile clean on a small long read fastq drops from taking ~45s to ~7s through use of a precomputed index.

This behaviour should first be characterised / verified on Linux and MacOS. Assuming the performance benefits are replicated on both OSs, adding invisible (but suitably logged) index caching and reuse should be done unless a good reason not to do so becomes apparent.

This will dramatically reduce execution time for processing many long read samples where this redundant indexing overhead is a nuisance.

The text was updated successfully, but these errors were encountered:

bede · 2024-12-16T18:14:39Z

Merged into main, pending release

bede · 2024-12-19T17:31:47Z

Released in 2.0.0

bede mentioned this issue Jul 16, 2024

Support accepting stdin instead of a specific filepath for single ended data #35

Closed

bede added the enhancement New feature or request label Sep 9, 2024

bede closed this as completed Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically generate and cache minimap2 indexes to eliminate redundant indexing overhead #39

Automatically generate and cache minimap2 indexes to eliminate redundant indexing overhead #39

bede commented Jun 14, 2024 •

edited

Loading

bede commented Dec 16, 2024

bede commented Dec 19, 2024

Automatically generate and cache minimap2 indexes to eliminate redundant indexing overhead #39

Automatically generate and cache minimap2 indexes to eliminate redundant indexing overhead #39

Comments

bede commented Jun 14, 2024 • edited Loading

bede commented Dec 16, 2024

bede commented Dec 19, 2024

bede commented Jun 14, 2024 •

edited

Loading