Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added sra-tools for RNASeq data download #103

Merged
merged 3 commits into from
Nov 19, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
9. Now skipping FastQC by default [#98](https://github.com/Plant-Food-Research-Open/genepal/issues/98)
10. Added an HTML report [#44](https://github.com/Plant-Food-Research-Open/genepal/issues/44)
11. Added content type as text/html for the MultiQC and genepal reports
12. Added sra-tools for RNASeq data download [#102](https://github.com/Plant-Food-Research-Open/genepal/issues/102)

### `Fixed`

Expand Down
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@
- [fasta_validator](https://github.com/linsalrob/fasta_validator): Validate genome FASTA
- [RepeatModeler](https://github.com/Dfam-consortium/RepeatModeler) or [EDTA](https://github.com/oushujun/EDTA): Create TE library
- [RepeatMasker](https://github.com/rmhubley/RepeatMasker): Soft mask the genome fasta
- [sra-tools](https://github.com/ncbi/sra-tools): RNASeq data download from SRA
- [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc), [fastp](https://github.com/OpenGene/fastp), [SortMeRNA](https://github.com/sortmerna/sortmerna): QC, trim and filter RNASeq evidence
- [STAR](https://github.com/alexdobin/STAR): RNASeq alignment
- Directly provided BAM files should be `--outSAMstrandField intronMotif` compliant
Expand Down
4 changes: 2 additions & 2 deletions assets/schema_rna.json
Original file line number Diff line number Diff line change
Expand Up @@ -14,8 +14,8 @@
},
"file_1": {
"type": "string",
"pattern": "^\\S+\\.(f(ast)?q\\.gz|bam)$",
"errorMessage": "FASTQ file for reads 1 or BAM file containing mapped reads must be provided, cannot contain spaces and must have extension '.fq.gz', '.fastq.gz' or '.bam'"
"pattern": "^SRR\\w+$|^\\S+\\.(f(ast)?q\\.gz|bam)$",
"errorMessage": "SRA ID that begins with SRR or FASTQ file for reads 1 or BAM file containing mapped reads must be provided. Files cannot contain spaces and must have extension '.fq.gz', '.fastq.gz' or '.bam'"
},
"file_2": {
"errorMessage": "FASTQ file for reads 2 cannot contain spaces and must have extension '.fq.gz' or '.fastq.gz'",
Expand Down
Binary file modified docs/img/genepal.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 1 addition & 1 deletion docs/output.md
Original file line number Diff line number Diff line change
Expand Up @@ -188,7 +188,7 @@ The final annotation files are saved in GFF3 format validated with [GENOMETOOLS]

</details>

The completeness of the annotations is checked with [BUSCO](https://gitlab.com/ezlab/busco). TO provide a comparative baseline, the completeness of the genomes is also checked. Moreover, the canonical/non-canonical splicing of the introns is also assessed by the pipeline.
The completeness of the annotations is checked with [BUSCO](https://gitlab.com/ezlab/busco). To provide a comparative baseline, the completeness of the genomes is also checked. Moreover, the canonical/non-canonical splicing of the introns is also assessed by the pipeline.

### Reports

Expand Down
2 changes: 1 addition & 1 deletion docs/usage.md
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,7 @@ With these two parameters, the pipeline has sufficient inputs to execute the [BR
RNASeq evidence must be provided through a samplesheet in CSV format which has the following columns,

- `sample:` A sample identifier. The `sample` identifiers have to be the same when you have re-sequenced the same sample more than once e.g. to increase sequencing depth. The pipeline will concatenate the raw reads before performing any downstream analysis.
- `file_1:` A FASTQ or BAM file
- `file_1:` A SRA ID for paired-end reads or FASTQ or BAM file
- `file_2:` A FASTQ file if `file_1` is also a FASTQ file and provides paired samples.
- `target_assemblies:` A semicolon `;` separated list of assembly tags from the [assemblysheet input](#assemblysheet-input). If `file_1` points to a BAM file, only a single assembly can be listed under `target_assemblies` for that sample. FASTQ data from `file_1` and `file_2` is aligned against each target assembly. BAM data from `file_1` is considered already aligned against the target assembly and is directly fed to BRAKER.

Expand Down
6 changes: 3 additions & 3 deletions main.nf
Original file line number Diff line number Diff line change
Expand Up @@ -36,8 +36,8 @@ workflow PLANTFOODRESEARCHOPEN_GENEPAL {
ch_braker_annotation
ch_braker_ex_asm_str
ch_benchmark_gff
ch_rna_sra
ch_rna_fq
ch_rna_bam
ch_rna_bam_by_assembly
ch_sortmerna_fastas
ch_ext_prot_fastas
Expand All @@ -58,8 +58,8 @@ workflow PLANTFOODRESEARCHOPEN_GENEPAL {
ch_braker_annotation,
ch_braker_ex_asm_str,
ch_benchmark_gff,
ch_rna_sra,
ch_rna_fq,
ch_rna_bam,
ch_rna_bam_by_assembly,
ch_sortmerna_fastas,
ch_ext_prot_fastas,
Expand Down Expand Up @@ -105,8 +105,8 @@ workflow {
PIPELINE_INITIALISATION.out.braker_annotation,
PIPELINE_INITIALISATION.out.braker_ex_asm_str,
PIPELINE_INITIALISATION.out.benchmark_gff,
PIPELINE_INITIALISATION.out.rna_sra,
PIPELINE_INITIALISATION.out.rna_fq,
PIPELINE_INITIALISATION.out.rna_bam,
PIPELINE_INITIALISATION.out.rna_bam_by_assembly,
PIPELINE_INITIALISATION.out.sortmerna_fastas,
PIPELINE_INITIALISATION.out.ext_prot_fastas,
Expand Down
20 changes: 20 additions & 0 deletions modules.json
Original file line number Diff line number Diff line change
Expand Up @@ -150,6 +150,11 @@
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["modules"]
},
"custom/sratoolsncbisettings": {
"branch": "master",
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["fastq_download_prefetch_fasterqdump_sratools"]
},
"diamond/makedb": {
"branch": "master",
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
Expand Down Expand Up @@ -235,6 +240,16 @@
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["modules"]
},
"sratools/fasterqdump": {
"branch": "master",
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["fastq_download_prefetch_fasterqdump_sratools"]
},
"sratools/prefetch": {
"branch": "master",
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["fastq_download_prefetch_fasterqdump_sratools"]
},
"star/align": {
"branch": "master",
"git_sha": "3aa0aec1d52d492fe241919f0c6100ebf0074082",
Expand Down Expand Up @@ -264,6 +279,11 @@
},
"subworkflows": {
"nf-core": {
"fastq_download_prefetch_fasterqdump_sratools": {
"branch": "master",
"git_sha": "1fc29f92e439d5631fdf34b8ac4687297d70f5ec",
"installed_by": ["subworkflows"]
},
"fastq_fastqc_umitools_fastp": {
"branch": "master",
"git_sha": "4026bab16a91b1b0b18d80ff465819ca725f33fd",
Expand Down
5 changes: 5 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/environment.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

23 changes: 23 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/main.nf

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

32 changes: 32 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/meta.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

42 changes: 42 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/tests/main.nf.test

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

14 changes: 14 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/tests/nextflow.config

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 2 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/tests/tags.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

6 changes: 6 additions & 0 deletions modules/nf-core/sratools/fasterqdump/environment.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading
Loading