An installation helper, perl wrapper and the R program Battenberg which detects subclonality and copy number in matched NGS data.
Master | Develop |
---|---|
This is only suitable for WGS analysis.
The Battenberg R code is maintained in a separate repository Wedge-Oxford/battenberg and this is where any questions or issues specific to the R code should be directed.
To prevent scientifi notation being used for genome coordinates, rlease ensure that your ~/.Rprofile
contains the following line:
options(scipen = 999)
There is a pre-built image containing this codebase on quay.io.
- dockstore-cgpwgs: Contains additional tools for WGS analysis.
This was primarily designed for use with dockstore.org but can be used as normal containers.
The docker images are know to work correctly after import into a singularity image.
The battenberg R files are installed automatically from the Battenberg GitHub repository found
here. The linked version is currently v2.2.5
.
Please install the following first:
Then execute:
setup.sh <install_to_folder> [X/lib/perl:Y/lib/perl]
cd Rsupport
./setupR.sh <install_to_folder>/R-libs
All of the items listed here use the same install method.
- Impute2 executables can be found here
- Any impute related data for download
- BWA Mapped, indexed, duplicate marked/removed bam files, for both a matched normal and tumour sample
- Reference.fasta and index
- A file containing a list of contigs in the reference .fai to ignore
Some required data files are not included in the distribution but a script is included to generate these for you:
- Directory containing the 1000 genomes allele and loci data:
- Generated using the included script
download_generate_bberg_ref_files.pl
- Generated using the included script
- Impute info file
impute_info.txt
- Generated using the included script
download_generate_bberg_ref_files.pl
- Generated using the included script
- Prob loci file probloci.txt
- Included:
files/probloci.txt.gz
- Included:
Additionally, the wgs_gc_correction_1000g files need to be downloaded. These can be obtained from the Battenberg R code site here.
- The chr X wgs_gc_correction_1000g will require renaming.
mv 1000_genomes_GC_corr_chr_23.txt.gz 1000_genomes_GC_corr_chr_X.txt.gz
For the most up to date usage instructions for the wrapper code please see the command line help:
battenberg.pl -h
Please check the wiki for common problems before raising any issues.
Copyright (c) 2014-2018 Genome Research Ltd.
Author: Cancer Genome Project <[email protected]>
This file is part of cgpBattenberg.
cgpBattenberg is free software: you can redistribute it and/or modify it under
the terms of the GNU Affero General Public License as published by the Free
Software Foundation; either version 3 of the License, or (at your option) any
later version.
This program is distributed in the hope that it will be useful, but WITHOUT
ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more
details.
You should have received a copy of the GNU Affero General Public License
along with this program. If not, see <http://www.gnu.org/licenses/>.
1. The usage of a range of years within a copyright statement contained within
this distribution should be interpreted as being equivalent to a list of years
including the first and last year specified and all consecutive years between
them. For example, a copyright statement that reads 'Copyright (c) 2005, 2007-
2009, 2011-2012' should be interpreted as being identical to a statement that
reads 'Copyright (c) 2005, 2007, 2008, 2009, 2011, 2012' and a copyright
statement that reads "Copyright (c) 2005-2012' should be interpreted as being
identical to a statement that reads 'Copyright (c) 2005, 2006, 2007, 2008,
2009, 2010, 2011, 2012'."