-
Notifications
You must be signed in to change notification settings - Fork 8
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
perf: ont downsampling normalisation (#489)
* wrote new script map_bin_amp_sampler.py for 'normalizing' reducing/coverage on each amplicon to a common cutoff and started modifying rule long_read.smk to do the reuired mapping and run the script * added write out and convert of picked reads from source fastq to out-fasta * change in- and output file-extensions for all rules downstream of cap_cov_amp from fastq to fasta; changed also in get_reads_by_stage * fixed several snakemake related bugs * multiple restorations to repair broken workflow * established custom script for trimming of primers work-title: map-trim * improved coverage-capping amp_covcap_sampler.py and amplicon primer-trimming map_trim.py scripts, including reverse strand swap for trimming * added improved logic for clipping of '-' strand aligning reads * use only one map-trim to remove all, reinstated porechop; multiple adaptations and fixes in assembly.smk, common.smk, long_read.smk, qc.smk, amp_covcap_sampler.py, map_trim.py; repaired workflow and changed naming patterns and formatting for ont; medaka rule -v; get_reads_by_stage, get trimmed reads, species diversity_before_se, seqtk.yaml * remove map-trim in order to reorganize * perform all clipping and downsampling with notramp * multiple renamings and adaptions * cleanup * adjusted output canu_correct * linting * get_artic_primer * increased notramp version; reroute stdout+stderr to log in canu_correct * restored get_artic_primers to get the bed file * get_output_dir comments * restore sample-table * envs version nums * envs conda-forge * resolving comments * updated primer paths * rmv amp_covcap_sampler.py * canu_correct: moved corr_gz from output to parameters * nanofilt accept de/compressed * change gzip to compressed * with corr_gz in output * take out bcf rules * orf * echo test * remove parenthesis * added gzip to nanofilt env and replace gunzip with gzip in rule nanofilt * add file to nanofilt env * set corMemory for testing to 6GB * corona-spades in long_read.smk * solved all issues with wrong input-files for ont de-novo assembly * cleanup and temp-notes * restored sample-table, removed echos, removed hard-coded path * resolve log issue * notramp env * undo changes in test config * undo changes in test primer * add missing version in nanofilt.yaml * add remvoed temp flags back in * rmv whitespace * update env file * add canu logging * update config * add temp * rmv hardcoded date path * add logs Co-authored-by: Thomas Battenfeld <[email protected]> Co-authored-by: Alexander Thomas <[email protected]> Co-authored-by: Thomas Battenfeld <[email protected]>
- Loading branch information
1 parent
f44942d
commit f502dd2
Showing
9 changed files
with
86 additions
and
1,343 deletions.
There are no files selected for viewing
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -3,3 +3,5 @@ channels: | |
- conda-forge | ||
dependencies: | ||
- nanofilt =2.8 | ||
- gzip =1.11 | ||
- file =5.39 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,7 @@ | ||
channels: | ||
- simakro | ||
- bioconda | ||
- conda-forge | ||
dependencies: | ||
- porechop =0.2 | ||
- python =3.9 | ||
- notramp =1.0.5 | ||
- minimap2 =2.22 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -2,4 +2,4 @@ channels: | |
- bioconda | ||
- conda-forge | ||
dependencies: | ||
- porechop =0.2 | ||
- seqtk =1.3 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters