Pipe-A-line
Pre-release
Pre-release
What's new in 0.8.0?
- Align now generates CRAM files with minimal headers instead of BAM files with full headers so that large FASTA files and large FASTQ files eventually require less storage space.
- Align has been re-implemented as two shell pipelines instead of as a series of separate commands glued together with Python, to make it run faster and require less storage of temporary files.
- A new function for parsing only the names of references in FASTA files (if the sequences are not needed) is based on grep and runs several times faster on large files than does the Python-based function for parsing both names and sequences.
- The ambiguous nucleotide "N" is now supported in both reference and read sequences (previously, neither).
- Unit tests have been updated to handle N in DNA and RNA sequences.
Full Changelog: v0.7.1...v0.8.0