Skip to content

Pipe-A-line

Pre-release
Pre-release
Compare
Choose a tag to compare
@matthewfallan matthewfallan released this 06 Sep 16:50
· 964 commits to main since this release

What's new in 0.8.0?

  • Align now generates CRAM files with minimal headers instead of BAM files with full headers so that large FASTA files and large FASTQ files eventually require less storage space.
  • Align has been re-implemented as two shell pipelines instead of as a series of separate commands glued together with Python, to make it run faster and require less storage of temporary files.
  • A new function for parsing only the names of references in FASTA files (if the sequences are not needed) is based on grep and runs several times faster on large files than does the Python-based function for parsing both names and sequences.
  • The ambiguous nucleotide "N" is now supported in both reference and read sequences (previously, neither).
  • Unit tests have been updated to handle N in DNA and RNA sequences.

Full Changelog: v0.7.1...v0.8.0