Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2.3.2 patch release #703

Merged
merged 167 commits into from
Mar 16, 2021
Merged
Show file tree
Hide file tree
Changes from 161 commits
Commits
Show all changes
167 commits
Select commit Hold shift + click to select a range
b71767d
Merge pull request #10 from nf-core/dev
jfy133 Jan 14, 2021
a6e356e
Post-release verison bump to dev
jfy133 Jan 14, 2021
a548d90
update minimum Nextflow version
jfy133 Jan 14, 2021
5e86217
Update CI minimum version
jfy133 Jan 14, 2021
53f95f4
Merge pull request #662 from jfy133/dev
apeltzer Jan 14, 2021
97d1353
Merge pull request #663 from jfy133/min-nf-bump
apeltzer Jan 14, 2021
7d198d8
Document that `skip_collapse` turns on paired-end alignment.
charles-plessy Jan 15, 2021
87254a5
Fix file staging for print_nuclear_contamination
IdoBar Jan 19, 2021
c7a1ebc
Fix file staging for print_nuclear_contamination
IdoBar Jan 19, 2021
44869f1
Merge branch 'patch-3' of https://github.com/IdoBar/eager into patch-3
IdoBar Jan 19, 2021
0968111
Merge branch 'dev' into patch-3
apeltzer Jan 19, 2021
e551d76
Correct line spacing in CHANGELOG.md
IdoBar Jan 20, 2021
35c2fd0
Fix logo
jfy133 Jan 22, 2021
78bed39
Add ash to contribtuors for all thebug reports
jfy133 Jan 22, 2021
628e815
Remove ignore system for MultiQC
jfy133 Jan 24, 2021
4521597
Update CHANGELOG.md
jfy133 Jan 24, 2021
b6a17f6
Merge branch 'dev' into fix-mqc-retry
jfy133 Jan 25, 2021
309ab22
Allow qualimap to ocntinue if one BAM is empty
jfy133 Jan 25, 2021
e950066
Update CHANGELOG.md
apeltzer Jan 25, 2021
1f7e7a2
Merge pull request #670 from nf-core/fix-mqc-retry
jfy133 Jan 26, 2021
24f05c5
Merge pull request #669 from nf-core/readme-logo
jfy133 Jan 26, 2021
7cb739a
Update README.md
jfy133 Jan 26, 2021
8801b5e
Merge branch 'dev' into patch-5
jfy133 Jan 26, 2021
1a3e373
Update nextflow_schema.json
jfy133 Jan 26, 2021
361b445
Update nextflow_schema.json
jfy133 Jan 26, 2021
a837265
Update CHANGELOG.md
jfy133 Jan 26, 2021
1757dd2
Update README.md
apeltzer Jan 26, 2021
966f7e0
Correct file staging for nuclear contamination process
IdoBar Jan 26, 2021
01cf3e9
Merge pull request #667 from IdoBar/patch-3
TCLamnidis Jan 27, 2021
85dbda0
Merge pull request #665 from charles-plessy/patch-5
jfy133 Jan 28, 2021
178be78
Remove java Xms from Picard tools flags
jfy133 Jan 28, 2021
5319af2
fix double sqaure bracket inconsistencies
jfy133 Jan 28, 2021
bb646f0
Make bash evaluation more strict
jfy133 Jan 29, 2021
2a980c9
Update main.nf
jfy133 Jan 29, 2021
0468dfa
fix kraken db loading fromPath
Feb 1, 2021
ee88d1c
Merge branch 'dev' of github.com:nf-core/eager into dev
Feb 1, 2021
1fd116d
update changelog
Feb 1, 2021
f8fa322
change basenaming of database
Feb 1, 2021
6724770
renaming archive files in decomp kraken
Feb 1, 2021
fea3249
handle case when files are already in $dbdir
Feb 2, 2021
d89b4cc
transforms to value channel with .first()
Feb 3, 2021
cc3ff23
Merge pull request #673 from maxibor/dev
apeltzer Feb 3, 2021
132f8b5
Linting
jfy133 Feb 4, 2021
cbaea3d
Linting
jfy133 Feb 4, 2021
a1a12b8
typo fixes
jfy133 Feb 4, 2021
466fd3f
Merge branch 'java-tool-fixes' of github.com:nf-core/eager into java-…
jfy133 Feb 4, 2021
5669222
Add fix to #626
jfy133 Feb 4, 2021
bce3e37
Fix consistency
jfy133 Feb 4, 2021
ad5d609
Merge branch 'dev' into qualimap-empty-bam-fix
jfy133 Feb 4, 2021
0307cde
Work on converting AR logic from bash to NF
jfy133 Feb 5, 2021
694afec
Adding in the first bits on that one
apeltzer Feb 5, 2021
5f31576
Check conda channels
apeltzer Feb 5, 2021
20f7d57
Fixed CI, this should work alreadyd well
apeltzer Feb 5, 2021
a041256
No more help checks, this should be fine
apeltzer Feb 7, 2021
d0a60df
Adding in enable_conda for automatic conda channel chceking :wq!
apeltzer Feb 7, 2021
fc48d9b
Proper DOI
apeltzer Feb 7, 2021
3cd8e9c
Hopefully helps with summary_parsm
apeltzer Feb 7, 2021
f464b62
Add TODO
apeltzer Feb 7, 2021
c18a049
Fixed all issues - please check
apeltzer Feb 8, 2021
6d237ed
Add extra deps required for validation of schema
apeltzer Feb 11, 2021
4c784e8
Fix issue in schema + main
apeltzer Feb 11, 2021
a2ac8eb
Forgot a piece of code
apeltzer Feb 12, 2021
a7b546f
aR debugging for eva
jfy133 Feb 14, 2021
f33a66e
move eva debuggin
jfy133 Feb 14, 2021
43dc51e
Maybe compression issue?
jfy133 Feb 14, 2021
85daee7
See if preserve5p bash works at eva
jfy133 Feb 15, 2021
5c2de88
Update nextflow.config
jfy133 Feb 15, 2021
d00611f
Add docs about igenomes_ignore being turned on
jfy133 Feb 15, 2021
dbdd83b
Update CHANGELOG.md
jfy133 Feb 15, 2021
ebeff6f
Linting
jfy133 Feb 15, 2021
b6d4928
Finished adapterremoval bash removal
jfy133 Feb 15, 2021
a90c41b
Update changelog
jfy133 Feb 15, 2021
01658f1
Remove accidental copy-pasta typo
jfy133 Feb 15, 2021
ead4089
Sync print_nuc_contam with latest dev
jfy133 Feb 15, 2021
c4a6962
Merge pull request #684 from nf-core/igenomes-ignore
apeltzer Feb 15, 2021
c49cf6d
Switching over to newest new schema lib
apeltzer Feb 15, 2021
642cb00
Merge branch 'dev' into qualimap-empty-bam-fix
apeltzer Feb 16, 2021
081fa26
Merge pull request #678 from nf-core/qualimap-empty-bam-fix
apeltzer Feb 16, 2021
f773b5c
Merge branch 'dev' into empty-tsv-column
apeltzer Feb 16, 2021
0a732cf
Merge pull request #677 from nf-core/empty-tsv-column
apeltzer Feb 16, 2021
cf80f35
Adjusted to latest version in tools
apeltzer Feb 18, 2021
045734a
Fix for log_colours
apeltzer Feb 18, 2021
8fb2abc
Remove bash conditions from dedup
jfy133 Feb 19, 2021
522fcc3
Added exit code 140
jfy133 Feb 20, 2021
d4ccfd6
Remove dedup/markdup/sexdet and samtools filter bash
jfy133 Feb 22, 2021
268ed0f
Adding Kraken2 unique kmer counting
maxibor Feb 22, 2021
8d76dd7
update changelog
maxibor Feb 22, 2021
ee049bf
remove db channel from kraken compatibility process
maxibor Feb 22, 2021
63128b4
Adding Kraken2 unique kmer counting
maxibor Feb 22, 2021
b40b6c7
Merge branch 'dev' of github.com:maxibor/eager into dev
maxibor Feb 22, 2021
480f6ac
Merge pull request #686 from nf-core/add-exit-code
jfy133 Feb 25, 2021
35ac970
Merge branch 'dev' into java-tool-fixes
jfy133 Feb 25, 2021
82d4df3
Linting
jfy133 Feb 25, 2021
f73d74f
Adjust checks to only contain generic stuff
apeltzer Feb 25, 2021
b845ba6
Add in headers
apeltzer Feb 25, 2021
83b1e5d
Add in headers
apeltzer Feb 25, 2021
b19640c
Replace main.nf header equals to dashes to make merge conflicts easier
jfy133 Feb 25, 2021
eec51b8
Fix nextflow config and schema inconsistencies
jfy133 Feb 25, 2021
f9fd270
Readd previously deleted stuff
jfy133 Feb 25, 2021
1461dfd
Add previously removed CI test
jfy133 Feb 25, 2021
6936fc5
Make samtools filter output dir consistent
jfy133 Feb 26, 2021
2161fff
Fix AR tests
jfy133 Feb 26, 2021
8289659
Remove ARFixPrefix to PIGZ pipe
jfy133 Mar 1, 2021
d6beb69
Remove debugging code
jfy133 Mar 1, 2021
8a5b235
Add Xmx to ARFixPrefix
jfy133 Mar 1, 2021
c4a66fe
And undo...
jfy133 Mar 1, 2021
0f9f3f2
Try limiting ARFixPrefix memory as it's small
jfy133 Mar 1, 2021
49d01b8
Increase memory for AR FixPrefix
jfy133 Mar 1, 2021
14e62f8
Try different strategy
jfy133 Mar 1, 2021
196759e
Remove hardcoded ARFiXPrefix memory as issue was SGE/JAVA specific
jfy133 Mar 3, 2021
0e46356
Merge pull request #685 from nf-core/java-tool-fixes
jfy133 Mar 6, 2021
5ef6d81
Remove unnecesary picard -Xms flags
jfy133 Mar 6, 2021
3e470f5
Merge pull request #689 from nf-core/remove-xms
jfy133 Mar 6, 2021
8f916b6
Apply suggestions from code review
apeltzer Mar 7, 2021
10ade27
added default -doMajorMinor 1 to output angsd beagle format + correct…
alexandregilardet Mar 8, 2021
34ee577
Update environment.yml
apeltzer Mar 8, 2021
7f003a5
Update CHANGELOG.md
apeltzer Mar 8, 2021
069267a
Changed CHANGELOG.md for #690
alexandregilardet Mar 9, 2021
178b23a
Merge branch 'dev' into master
alexandregilardet Mar 9, 2021
c0002a9
Merge pull request #691 from nf-core/bump-mqc-1.10
apeltzer Mar 9, 2021
18f286e
Temporarily specify tbb library to get funtional bowtie2
jfy133 Mar 9, 2021
c156780
Fix version
jfy133 Mar 9, 2021
2185f79
corrected CHANGELOG.md dev section
alexandregilardet Mar 10, 2021
cb84882
corrected CHANGELOG.md dev section
alexandregilardet Mar 10, 2021
446fd49
Merge pull request #690 from alexandregilardet/master
jfy133 Mar 10, 2021
caeacf6
Merge pull request #692 from nf-core/jfy133-patch-1
apeltzer Mar 10, 2021
71ebc62
Add fix for broken colour chem. validation
jfy133 Mar 10, 2021
ab974e3
Update environment.yml
jfy133 Mar 11, 2021
9d8507a
Update CHANGELOG.md
jfy133 Mar 11, 2021
aed30d9
Merge pull request #696 from nf-core/mqc-hops-fix
jfy133 Mar 12, 2021
a17acad
Merge pull request #697 from nf-core/dev
jfy133 Mar 12, 2021
fa8e153
Guess we're good now
apeltzer Mar 12, 2021
f1c6c5b
Merge branch 'add-fancy-help-json-for-james' of https://github.com/nf…
apeltzer Mar 12, 2021
00f5782
Should be fine now
apeltzer Mar 12, 2021
06ef70d
Drop that trimmed too
apeltzer Mar 12, 2021
a5e32db
Merge pull request #680 from nf-core/add-fancy-help-json-for-james
jfy133 Mar 12, 2021
a819990
Merge pull request #694 from nf-core/colour-chem-validation
jfy133 Mar 12, 2021
ba58a69
added myself as contributor
alexandregilardet Mar 12, 2021
91b33ce
Fix tutorials due to [nextflow bug](https://github.com/nextflow-io/ne…
jfy133 Mar 12, 2021
3f5f659
Add `--qualitymax` to AdapterRemoval
jfy133 Mar 12, 2021
89d8684
Remove accidently duplicated qualitymax param in AR command
jfy133 Mar 12, 2021
c0b4796
Fix new param JSON schema type
jfy133 Mar 12, 2021
03f08d8
Merge pull request #699 from alexandregilardet/master
jfy133 Mar 12, 2021
0cd8e42
Merge pull request #698 from nf-core/docs-fix
jfy133 Mar 12, 2021
75f4205
Merge pull request #700 from nf-core/qualitymax
jfy133 Mar 12, 2021
d1015bb
Implement fix for https://github.com/nf-core/eager/issues/653
jfy133 Mar 12, 2021
5a4621b
Adjust to newest lib template
apeltzer Mar 12, 2021
c8e93b8
update changelog
jfy133 Mar 12, 2021
49e8ba9
Update documentation for sex det
jfy133 Mar 12, 2021
6f60c63
Markdown linting
jfy133 Mar 12, 2021
e083831
Merge branch 'dev' into sex-det-collision
jfy133 Mar 12, 2021
d840a0b
Second markdown linting
jfy133 Mar 12, 2021
43958a0
Merge branch 'dev' of github.com:nf-core/eager into dev
maxibor Mar 12, 2021
51681fb
update kraken process for multiqc updated kraken module
maxibor Mar 12, 2021
2a25cf7
make markdownlint happy
maxibor Mar 12, 2021
631e592
Merge pull request #701 from nf-core/sex-det-collision
jfy133 Mar 12, 2021
8133006
Merge pull request #687 from maxibor/dev
apeltzer Mar 12, 2021
05a0c22
Merge pull request #11 from nf-core/dev
jfy133 Mar 15, 2021
25308f3
Bump version for patch release
jfy133 Mar 15, 2021
eedbb9f
Update CHANGELOG.md
jfy133 Mar 15, 2021
de38b07
Merge pull request #702 from jfy133/dev
jfy133 Mar 15, 2021
db1e0f9
Update docs/usage.md
jfy133 Mar 16, 2021
65c26a2
Update nextflow_schema.json
jfy133 Mar 16, 2021
40285c5
Update nextflow_schema.json
jfy133 Mar 16, 2021
e3addc3
Updates from Code review including re-adding incorrectly removed defa…
jfy133 Mar 16, 2021
a8b4f9d
Merge branch 'dev' of github.com:nf-core/eager into dev
jfy133 Mar 16, 2021
ab63956
Add new citation
jfy133 Mar 16, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions .github/workflows/ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ jobs:
strategy:
matrix:
# Nextflow versions: check pipeline minimum and current latest
nxf_ver: ['20.04.0', '']
nxf_ver: ['20.07.1', '']
steps:
- name: Check out pipeline code
uses: actions/checkout@v2
Expand All @@ -34,13 +34,13 @@ jobs:

- name: Build new docker image
if: env.MATCHED_FILES
run: docker build --no-cache . -t nfcore/eager:2.3.1
run: docker build --no-cache . -t nfcore/eager:2.3.2

- name: Pull docker image
if: ${{ !env.MATCHED_FILES }}
run: |
docker pull nfcore/eager:dev
docker tag nfcore/eager:dev nfcore/eager:2.3.1
docker tag nfcore/eager:dev nfcore/eager:2.3.2

- name: Install Nextflow
env:
Expand Down
34 changes: 34 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,40 @@
The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/)
and this project adheres to [Semantic Versioning](http://semver.org/spec/v2.0.0.html).

## [2.3.2] - 2021-03-16

### `Added`

- [#687](https://github.com/nf-core/eager/pull/687) - Adds Kraken2 unique kmer counting report
- [#676](https://github.com/nf-core/eager/issues/676) - Refactor help message / summary message formatting to automatic versions using nf-core library
- [#682](https://github.com/nf-core/eager/issues/682) - Add AdapterRemoval `--qualitymax` flag to allow FASTQ Phred score range max more than 41

### `Fixed`

- [#666](https://github.com/nf-core/eager/issues/666) - Fixed input file staging for `print_nuclear_contamination`
- [#631](https://github.com/nf-core/eager/issues/631) - Update minimum Nextflow version to 20.07.1, due to unfortunate bug in Nextflow 20.04.1 causing eager to crash if patch pulled
- Made MultiQC crash behaviour stricter when dealing with large datasets, as reported by @ashildv
- [#652](https://github.com/nf-core/eager/issues/652) - Added note to documentation that when using `--skip_collapse` this will use _paired-end_ alignment mode with mappers when using PE data
- [#626](https://github.com/nf-core/eager/issues/626) - Add additional checks to ensure pipeline will give useful error if cells of a TSV column are empty
- Added note to documentation that when using `--skip_collapse` this will use _paired-end_ alignment mode with mappers when using PE data
- [#673](https://github.com/nf-core/eager/pull/673) - Fix Kraken database loading when loading from directory instead of compressed file
- [#688](https://github.com/nf-core/eager/issues/668) - Allow pipeline to complete, even if Qualimap crashes due to an empty or corrupt BAM file for one sample/library
- [#683](https://github.com/nf-core/eager/pull/683) - Sets `--igenomes_ignore` to true by default, as rarely used by users currently and makes resolving configs less complex
- Added exit code `140` to re-tryable exit code list to account for certain scheduler wall-time limit fails
- [#672](https://github.com/nf-core/eager/issues/672) - Removed java parameter from picard tools which could cause memory issues
- [#679](https://github.com/nf-core/eager/issues/679) - Refactor within-process bash conditions to groovy/nextflow, due to incompatibility with some servers environments
- [#690](https://github.com/nf-core/eager/pull/690) - Fixed ANGSD output mode for beagle by setting `-doMajorMinor 1` as default in that case
- [#693](https://github.com/nf-core/eager/issues/693) - Fixed broken TSV input validation for the Colour Chemistry column
- [#695](https://github.com/nf-core/eager/issues/695) - Fixed incorrect `-profile` order in tutorials (originally written reversed due to [nextflow bug](https://github.com/nextflow-io/nextflow/issues/1792))
- [#653](https://github.com/nf-core/eager/issues/653) - Fixed file collision errors with sexdeterrmine for two same-named libraries with different strandedness

### `Dependencies`

- Bumped MultiQC to 1.10 for improved functionality
- Bumped HOPS to 0.35 for MultiQC 1.10 compatibility

### `Deprecated`

## [2.3.1] - 2021-01-14

### `Added`
Expand Down
4 changes: 2 additions & 2 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -7,10 +7,10 @@ COPY environment.yml /
RUN conda env create --quiet -f /environment.yml && conda clean -a

# Add conda installation dir to PATH (instead of doing 'conda activate')
ENV PATH /opt/conda/envs/nf-core-eager-2.3.1/bin:$PATH
ENV PATH /opt/conda/envs/nf-core-eager-2.3.2/bin:$PATH

# Dump the details of the installed packages to a file for posterity
RUN conda env export --name nf-core-eager-2.3.1 > nf-core-eager-2.3.1.yml
RUN conda env export --name nf-core-eager-2.3.2 > nf-core-eager-2.3.2.yml

# Instruct R processes to use these empty files instead of clashing with a local version
RUN touch .Rprofile
Expand Down
7 changes: 5 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,10 @@
# ![nf-core/eager](docs/images/nf-core-eager_logo.png)
# ![nf-core/eager](docs/images/nf-core_eager_logo.png)

**A fully reproducible and state-of-the-art ancient DNA analysis pipeline**.

[![GitHub Actions CI Status](https://github.com/nf-core/eager/workflows/nf-core%20CI/badge.svg)](https://github.com/nf-core/eager/actions)
[![GitHub Actions Linting Status](https://github.com/nf-core/eager/workflows/nf-core%20linting/badge.svg)](https://github.com/nf-core/eager/actions)
[![Nextflow](https://img.shields.io/badge/nextflow-%E2%89%A520.04.0-brightgreen.svg)](https://www.nextflow.io/)
[![Nextflow](https://img.shields.io/badge/nextflow-%E2%89%A520.07.1-brightgreen.svg)](https://www.nextflow.io/)
[![nf-core](https://img.shields.io/badge/nf--core-pipeline-brightgreen.svg)](https://nf-co.re/)
[![DOI](https://zenodo.org/badge/135918251.svg)](https://zenodo.org/badge/latestdoi/135918251)

Expand Down Expand Up @@ -158,7 +158,10 @@ of this pipeline:

Those who have provided conceptual guidance, suggestions, bug reports etc.

* [Alexandre Gilardet](https://github.com/alexandregilardet)
* Arielle Munters
* [Charles Plessy](https://github.com/charles-plessy)
* [Åshild Vågene](https://github.com/ashildv)
* [Hester van Schalkwyk](https://github.com/hesterjvs)
* [Ido Bar](https://github.com/IdoBar)
* [Irina Velsko](https://github.com/ivelsko)
Expand Down
50 changes: 36 additions & 14 deletions bin/kraken_parse.py
Original file line number Diff line number Diff line change
Expand Up @@ -19,18 +19,24 @@ def _get_args():
default=50,
help="Minimum number of hits on clade to report it. Default = 50")
parser.add_argument(
'-o',
dest="output",
'-or',
dest="readout",
default=None,
help="Output file. Default = <basename>.kraken_parsed.csv")
help="Read count output file. Default = <basename>.read_kraken_parsed.csv")
parser.add_argument(
'-ok',
dest="kmerout",
default=None,
help="Kmer Output file. Default = <basename>.kmer_kraken_parsed.csv")

args = parser.parse_args()

infile = args.krakenReport
countlim = int(args.count)
outfile = args.output
readout = args.readout
kmerout = args.kmerout

return(infile, countlim, outfile)
return(infile, countlim, readout, kmerout)


def _get_basename(file_name):
Expand All @@ -51,14 +57,23 @@ def parse_kraken(infile, countlim):

'''
with open(infile, 'r') as f:
resdict = {}
read_dict = {}
kmer_dict = {}
csvreader = csv.reader(f, delimiter='\t')
for line in csvreader:
reads = int(line[1])
if reads >= countlim:
taxid = line[4]
resdict[taxid] = reads
return(resdict)
taxid = line[6]
kmer = line[3]
unique_kmer = line[4]
try:
kmer_duplicity = float(kmer)/float(unique_kmer)
except ZeroDivisionError:
kmer_duplicity = 0
read_dict[taxid] = reads
kmer_dict[taxid] = kmer_duplicity

return(read_dict, kmer_dict)


def write_output(resdict, infile, outfile):
Expand All @@ -70,10 +85,17 @@ def write_output(resdict, infile, outfile):


if __name__ == '__main__':
INFILE, COUNTLIM, outfile = _get_args()
INFILE, COUNTLIM, readout, kmerout = _get_args()

if not outfile:
outfile = _get_basename(INFILE)+".kraken_parsed.csv"
if not readout:
read_outfile = _get_basename(INFILE)+".read_kraken_parsed.csv"
else:
read_outfile = readout
if not kmerout:
kmer_outfile = _get_basename(INFILE)+".kmer_kraken_parsed.csv"
else:
kmer_outfile = kmerout

tmp_dict = parse_kraken(infile=INFILE, countlim=COUNTLIM)
write_output(resdict=tmp_dict, infile=INFILE, outfile=outfile)
read_dict, kmer_dict = parse_kraken(infile=INFILE, countlim=COUNTLIM)
write_output(resdict=read_dict, infile=INFILE, outfile=read_outfile)
write_output(resdict=kmer_dict, infile=INFILE, outfile=kmer_outfile)
33 changes: 21 additions & 12 deletions bin/merge_kraken_res.py
Original file line number Diff line number Diff line change
Expand Up @@ -15,21 +15,29 @@ def _get_args():
formatter_class=argparse.RawDescriptionHelpFormatter,
description='Merging csv count files in one table')
parser.add_argument(
'-o',
dest="output",
default="kraken_count_table.csv",
help="Output file. Default = kraken_count_table.csv")
'-or',
dest="readout",
default="kraken_read_count_table.csv",
help="Read count output file. Default = kraken_read_count_table.csv")
parser.add_argument(
'-ok',
dest="kmerout",
default="kraken_kmer_unicity_table.csv",
help="Kmer unicity output file. Default = kraken_kmer_unicity_table.csv")

args = parser.parse_args()

outfile = args.output
readout = args.readout
kmerout = args.kmerout

return(outfile)
return(readout, kmerout)


def get_csv():
tmp = [i for i in os.listdir() if ".csv" in i]
return(tmp)
kmer = [i for i in tmp if '.kmer_' in i]
read = [i for i in tmp if '.read_' in i]
return(read, kmer)


def _get_basename(file_name):
Expand All @@ -54,8 +62,9 @@ def write_csv(pd_dataframe, outfile):


if __name__ == "__main__":
OUTFILE = _get_args()
all_csv = get_csv()
resdf = merge_csv(all_csv)
write_csv(resdf, OUTFILE)
print(resdf)
READOUT, KMEROUT = _get_args()
reads, kmers = get_csv()
read_df = merge_csv(reads)
kmer_df = merge_csv(kmers)
write_csv(read_df, READOUT)
write_csv(kmer_df, KMEROUT)
18 changes: 7 additions & 11 deletions conf/base.config
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ process {
memory = { check_max( 7.GB * task.attempt, 'memory' ) }
time = { check_max( 24.h * task.attempt, 'time' ) }

errorStrategy = { task.exitStatus in [143,137,104,134,139] ? 'retry' : 'finish' }
errorStrategy = { task.exitStatus in [143,137,104,134,139, 140] ? 'retry' : 'finish' }
maxRetries = 3
maxErrors = '-1'

Expand Down Expand Up @@ -74,38 +74,34 @@ process {
}

withName:qualimap{
errorStrategy = { task.exitStatus in [1,143,137,104,134,139] ? 'retry' : 'finish' }
errorStrategy = { task.exitStatus in [1,143,137,104,134,139, 140] ? 'retry' : task.exitStatus in [255] ? 'ignore' : 'finish' }
}

withName:preseq {
errorStrategy = 'ignore'
}

withName:damageprofiler {
errorStrategy = { task.exitStatus in [1,143,137,104,134,139] ? 'retry' : 'finish' }
errorStrategy = { task.exitStatus in [1,143,137,104,134,139, 140] ? 'retry' : 'finish' }
}

// Add 1 retry for certain java tools as not enough heap space java errors gives exit code 1
withName: dedup {
errorStrategy = { task.exitStatus in [1,143,137,104,134,139] ? 'retry' : 'finish' }
errorStrategy = { task.exitStatus in [1,143,137,104,134,139, 140] ? 'retry' : 'finish' }
}

withName: markduplicates {
errorStrategy = { task.exitStatus in [143,137] ? 'retry' : 'finish' }
errorStrategy = { task.exitStatus in [143,137, 140] ? 'retry' : 'finish' }
}

// Add 1 retry as not enough heapspace java error gives exit code 1
withName: malt {
errorStrategy = { task.exitStatus in [1,143,137,104,134,139] ? 'retry' : 'finish' }
errorStrategy = { task.exitStatus in [1,143,137,104,134,139, 140] ? 'retry' : 'finish' }
}

// other process specific exit statuses
withName: nuclear_contamination {
errorStrategy = { task.exitStatus in [143,137,104,134,139] ? 'ignore' : 'retry' }
}

withName: multiqc {
errorStrategy = { task.exitStatus in [143,137] ? 'retry' : 'ignore' }
errorStrategy = { task.exitStatus in [143,137,104,134,139, 140] ? 'ignore' : 'retry' }
}

}
Expand Down
Binary file modified docs/images/tutorials/profiles/config_profile_inheritence.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
18 changes: 9 additions & 9 deletions docs/images/tutorials/profiles/config_profile_inheritence.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
10 changes: 9 additions & 1 deletion docs/output.md
Original file line number Diff line number Diff line change
Expand Up @@ -551,6 +551,8 @@ Note that many of the statistics from this module are displayed in the General S

You will receive output for each *sample*. This means you will statistics of deduplicated values of all types of libraries combined in a single value (i.e. non-UDG treated, full-UDG, paired-end, single-end all together).

:warning: If your library has no reads mapping to the reference, this will result in an empty BAM file. Qualimap will therefore not produce any output even if a BAM exists!

#### Coverage Histogram

This plot shows on the Y axis the range of fold coverages that the bases of the reference genome are possibly covered by. The Y axis shows the number of bases that were covered at the given fold coverage depth as indicated on the Y axis.
Expand Down Expand Up @@ -598,6 +600,8 @@ Sex.DetERRmine calculates the coverage of your mapped reads on the X and Y chrom

When a bedfile of specific sites is provided, Sex.DetERRmine additionally calculates error bars around each relative coverage estimate. For this estimate to be trustworthy, the sites included in the bedfile should be spaced apart enough that a single sequencing read cannot overlap multiple sites. Hence, when a bedfile has not been provided, this error should be ignored. When a suitable bedfile is provided, each observation of a covered site is independent, and the error around the coverage is equal to the binomial error estimate. This error is then propagated during the calculation of relative coverage for the X and Y chromosomes.

> Note that in nf-core/eager this will be run on single- and double-stranded variants of the same library _separately_. This can also help assess for differential contamination between libraries.

#### Relative Coverage

Theoretically, males are expected to cluster around (0.5, 0.5) in the produced scatter plot, while females are expected to cluster around (1.0, 0.0). In practice, when analysing ancient DNA, these relative coverage on both axes is slightly lower than expected, and individuals can cluster around (0.45, 0.45) and (0.85, 0.05). As the number of covered sites for an individual gets smaller, the confidence on the estimate becomes lower, because it is increasingly more likely to be affected by randomness in the preservation and sequencing of ancient DNA.
Expand Down Expand Up @@ -667,7 +671,11 @@ Each module has it's own output directory which sit alongside the `MultiQC/` dir
- `metagenomic_complexity_filter` - this contains the output from filtering of input reads to metagenomic classification of low-sequence complexity reads as performed by `bbduk`. This will include the filtered FASTQ files (`*_lowcomplexityremoved.fq.gz`) and also the run-time log (`_bbduk.stats`) for each sample. **Note:** there are no sections in the MultiQC report for this module, therefore you must check the `._bbduk.stats` files to get summary statistics of the filtering.
- `metagenomic_classification/` - this contains the output for a given metagenomic classifier.
- Running MALT will contain RMA6 files that can be loaded into MEGAN6 or MaltExtract for phylogenetic visualisation of read taxonomic assignments and aDNA characteristics respectively. Additional a `malt.log` file is provided which gives additional information such as run-time, memory usage and per-sample statistics of numbers of alignments with taxonomic assignment etc. This will also include gzip SAM files if requested.
- Running kraken will contain the Kraken output and report files, as well as a merged Taxon count table.
- Running kraken will contain the Kraken output and report files, as well as a merged Taxon count table. You will also get a Kraken kmer duplication table, in a [KrakenUniq](https://genomebiology.biomedcentral.com/articles/10.1186/s13059-018-1568-0) fashion. This is very useful to check for breadth of coverage and detect read stacking. A small number of aligned reads (low coverage) and a kmer duplication >1 is usually a sign of read stacking, usually indicative of a false positive hit (e.g. from over-amplified libraries). *Kmer duplication is defined as: number of kmers / number of unique kmers*. You will find two kraken reports formats available:
- the `*.kreport` which is the old report format, without distinct minimizer count information, used by some tools such as [Pavian](https://github.com/fbreitwieser/pavian)
- the `*.kraken2_report` which is the new kraken report format, with the distinct minimizer count information.

Finally, the `*.kraken.out` file are the direct output of Kraken2
- `maltextract/` - this contains a `results` directory in which contains the output from MaltExtract - typically one folder for each filter type, an error and a log file. The characteristics of each node (e.g. damage, read lengths, edit distances - each in different txt formats) can be seen in each sub-folder of the filter folders. Output can be visualised either with the [HOPS postprocessing script](https://github.com/rhuebler/HOPS) or [MEx-IPA](https://github.com/jfy133/MEx-IPA)
- `consensus_sequence/` - this contains three FASTA files from VCF2Genome of a consensus sequence based on the reference FASTA with each sample's unique modifications. The main FASTA is a standard file with bases not passing the specified thresholds as Ns. The two other FASTAS (`_refmod.fasta.gz`) and (`_uncertainity.fasta.gz`) are IUPAC uncertainty codes (rather than Ns) and a special number-based uncertainty system used for other downstream tools, respectively.
- `librarymerged_bams/` - these contain the final BAM files that would go into genotyping (if genotyping is turned on). This means the files will contain all libraries of a given sample (including trimmed non-UDG or half-UDG treated libraries, if BAM trimming turned on)
Loading