Skip to content

seqan/iGenVar

Repository files navigation

iGenVar

Build Status

The official repository for the iGenVar project.

iGenVar is intended to be a caller for all types of genetic variation: SNPs, indels and larger structural variations (insertions, deletions, inversions, translocations, CNVs, nested SVs). It uses both Illumina short reads and PacBio long reads for this purpose.

David Heller in the Vingron lab of the MPI-MG and Tim White in the Kehr lab at BIH have both developed an SV caller for long read sequencing data. Instead of competing with each other, we want to join forces and combine the two tools, SVIM and SVIRL, into one better and more versatile tool. On the other hand, there were some tool developments in the Reinert Lab (FU Berlin): Vaquita, a short read SV caller, SViper, a refinement tool and Vaquita-LR a further development of Vaquita for long reads. We want to combine these approaches and use the SeqAn3 library as a basis for this new tool.

Current status:

We can call insertions and deletions from long read data (SVIM methods implemented). For more information, see the release plan at the bottom of the page.

Installation

Instructions:

  1. clone this repository: git clone --recurse-submodules https://github.com/seqan/iGenVar.git or git clone https://github.com/seqan/iGenVar.git and fetch the seqan3 submodule after cloning: git submodule update --recursive --init
  2. create a build directory and visit it: mkdir build && cd build
  3. run cmake: cmake ../iGenVar
  4. build the application: make
  5. optional: build and run the tests: make test or ctest
  6. optional: build the api documentation: make doc
  7. execute the app: ./bin/iGenVar

(Built using the SeqAn3 App Template)

We created small examples, which you can use to test our app:

./bin/iGenVar -i ./test/data/paired_end_mini_example.sam -j ./test/data/single_end_mini_example.sam \
-o ./test/data/output.vcf --method cigar_string --method split_read --min_var_length 5

Release plan:

Dependencies

We support gcc/g++ 10 and higher.