Cactus 2.5.0 2023-04-03
Cactus 2.5.0 is available in the following forms:
- Docker Image:
quay.io/comparative-genomics-toolkit/cactus:v2.5.0
GPU-accelerated Docker Image:quay.io/comparative-genomics-toolkit/cactus:v2.5.0-gpu
Install instructions in README.md - Pre-compiled Binaries Linux Tarball: cactus-bin-v2.5.0.tar.gz
Install instructions in BIN-INSTALL.md - Pre-compiled Binaries For Older CPU Architectures Linux Tarball: cactus-bin-legacy-v2.5.0.tar.gz
Install instructions in BIN-INSTALL.md - Source Tarball: cactus-v2.5.0.tar.gz
Install instructions in README.md
WARNING: do not use the github automatically generated source files (Source code (zip)
or Source code (tar.gz)
), these are not correct.
The Docker images and binaries linked above are built using AVX2 extensions, and require a CPU that supports them, except the "Pre-compiled Binaries For Older CPU Architectures" which should be compatible with any 64-bit architecture (and, since version 2.3.1, support Cactus's pangenome pipeline).
Please subscribe to the cactus-announce low-volume mailing list to receive notice of Cactus release.
Release Notes
This Release greatly simplifies the interface for building pangenomes
- Introduction of
cactus-pangenome
command that, likecactus
for the progressive aligner, runs the whole pipeline in one shot. Its inputs are a list of fasta sequences and sample names and it outputs the pangenome graph and various indexes, vcf, etc. Intermediate outputs are exported at the end of each stage, so low-level commands can be used to repeat or continue the workflow. #
characters in fasta contig names no longer need to be cleaned out with special invocation ofcactus-preprocess
to avoid conflicts withvg
indexes. This now happens automatically within the pipeline.- The lower level pangenome commands are all still supported and documented. But
cactus-align-batch
is now deprecated. This means it is removed from the documentation (except when describing older results) and from continuous integration tests, and it will give a warning when used. Users should now just runcactus-align
instead ofcactus-align-batch
where applicable, using the former's--batch
option.cactus-align-batch
was a hack required to scale up before the cactus v2.0 rewrite but used nested Toil workflows and needed to go. - Documentation updated to focus on the simpler interface
GFAffix
updated to fix a rare crashcactus-graphmap-join
(andcactus-pangenome
) will not fail in the event a VCF indexing error (ex from chromosomes that are too long fortbi
). Instead it will give a warning and produce no index.- New
cactus-hal2maf
option--keepGapCausingDupes
is changed to--filterGapCausingDupes
and turned off by default. The underlying code has bugs that cause problems on certain datasets, and is not ready to be activated by default.