Tools to identify and compare CRISPR arrays.
Now published at the CRISPR Journal (https://doi.org/10.1089/crispr.2022.0080)
CCTK is a collection of tools written in Python3 that are focused on the comparison of CRISPR arrays that share spacers with one another. CCTK includes two main tools: CRISPRdiff and CRISPRtree which can be used to visualize and analyze the relationships between CRISPR arrays. In addition, CCTK includes two scripts to identify CRISPR arrays genome assemblies and a script to generate a network representation of array relationships. The flow of data between CCTK tools is shown below.
CCTK tools:
cctk blast
andcctk minced
Identify CRISPR arrays in assemblies using a method based on MinCED or BLAST.cctk network
Represent spacers shared between arrays as a network.cctk crisprdiff
Visualize alignment of homologous arrays.cctk crisprtree
Infer maximum parsimony tree explaining array relationships.cctk constrain
Assess whether CRISPR array relationships support phylogenetic relationships inferred based on other data (e.g. MLST)cctk spacerblast
Identify targets of CRISPR spacers in a BLAST database: assess whole length of imperfect spacer-protospacer matches and check for presence of protospacer adjacent motif (PAM).
Documentation can be found at Read the docs
CCTK is a collection of scripts that designed to be run from the command line. Installation is as simple as downloading this repository to your computer and running the scripts in the terminal.
CCTK has a number of dependencies to function. These are listed in the Dependencies section below.
Conda can be used to install CCTK and all of its dependencies easily.
The latest version of CCTK can be installed from Anaconda using:
conda install -c bioconda cctk
To download this repository using git:
git clone https://github.com/Alan-Collins/CRISPR_comparison_toolkit.git
N.B. some dependencies are only required for certain tools. If you are only using CRISPRtree and CRISPRdiff then you only need python3 with matplotlib and dendropy.