Skip to content

Commit

Permalink
Merge pull request #103 from Plant-Food-Research-Open/feat/sratools
Browse files Browse the repository at this point in the history
Added sra-tools for RNASeq data download
  • Loading branch information
GallVp authored Nov 19, 2024
2 parents 727a66b + bfc15fe commit 1e05bc6
Show file tree
Hide file tree
Showing 40 changed files with 1,349 additions and 32 deletions.
1 change: 1 addition & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
9. Now skipping FastQC by default [#98](https://github.com/Plant-Food-Research-Open/genepal/issues/98)
10. Added an HTML report [#44](https://github.com/Plant-Food-Research-Open/genepal/issues/44)
11. Added content type as text/html for the MultiQC and genepal reports
12. Added sra-tools for RNASeq data download [#102](https://github.com/Plant-Food-Research-Open/genepal/issues/102)

### `Fixed`

Expand Down
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@
- [fasta_validator](https://github.com/linsalrob/fasta_validator): Validate genome FASTA
- [RepeatModeler](https://github.com/Dfam-consortium/RepeatModeler) or [EDTA](https://github.com/oushujun/EDTA): Create TE library
- [RepeatMasker](https://github.com/rmhubley/RepeatMasker): Soft mask the genome fasta
- [sra-tools](https://github.com/ncbi/sra-tools): RNASeq data download from SRA
- [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc), [fastp](https://github.com/OpenGene/fastp), [SortMeRNA](https://github.com/sortmerna/sortmerna): QC, trim and filter RNASeq evidence
- [STAR](https://github.com/alexdobin/STAR): RNASeq alignment
- Directly provided BAM files should be `--outSAMstrandField intronMotif` compliant
Expand Down
4 changes: 2 additions & 2 deletions assets/schema_rna.json
Original file line number Diff line number Diff line change
Expand Up @@ -14,8 +14,8 @@
},
"file_1": {
"type": "string",
"pattern": "^\\S+\\.(f(ast)?q\\.gz|bam)$",
"errorMessage": "FASTQ file for reads 1 or BAM file containing mapped reads must be provided, cannot contain spaces and must have extension '.fq.gz', '.fastq.gz' or '.bam'"
"pattern": "^SRR\\w+$|^\\S+\\.(f(ast)?q\\.gz|bam)$",
"errorMessage": "SRA ID that begins with SRR or FASTQ file for reads 1 or BAM file containing mapped reads must be provided. Files cannot contain spaces and must have extension '.fq.gz', '.fastq.gz' or '.bam'"
},
"file_2": {
"errorMessage": "FASTQ file for reads 2 cannot contain spaces and must have extension '.fq.gz' or '.fastq.gz'",
Expand Down
Binary file modified docs/img/genepal.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 1 addition & 1 deletion docs/output.md
Original file line number Diff line number Diff line change
Expand Up @@ -188,7 +188,7 @@ The final annotation files are saved in GFF3 format validated with [GENOMETOOLS]

</details>

The completeness of the annotations is checked with [BUSCO](https://gitlab.com/ezlab/busco). TO provide a comparative baseline, the completeness of the genomes is also checked. Moreover, the canonical/non-canonical splicing of the introns is also assessed by the pipeline.
The completeness of the annotations is checked with [BUSCO](https://gitlab.com/ezlab/busco). To provide a comparative baseline, the completeness of the genomes is also checked. Moreover, the canonical/non-canonical splicing of the introns is also assessed by the pipeline.

### Reports

Expand Down
2 changes: 1 addition & 1 deletion docs/usage.md
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,7 @@ With these two parameters, the pipeline has sufficient inputs to execute the [BR
RNASeq evidence must be provided through a samplesheet in CSV format which has the following columns,

- `sample:` A sample identifier. The `sample` identifiers have to be the same when you have re-sequenced the same sample more than once e.g. to increase sequencing depth. The pipeline will concatenate the raw reads before performing any downstream analysis.
- `file_1:` A FASTQ or BAM file
- `file_1:` A SRA ID for paired-end reads or FASTQ or BAM file
- `file_2:` A FASTQ file if `file_1` is also a FASTQ file and provides paired samples.
- `target_assemblies:` A semicolon `;` separated list of assembly tags from the [assemblysheet input](#assemblysheet-input). If `file_1` points to a BAM file, only a single assembly can be listed under `target_assemblies` for that sample. FASTQ data from `file_1` and `file_2` is aligned against each target assembly. BAM data from `file_1` is considered already aligned against the target assembly and is directly fed to BRAKER.

Expand Down
6 changes: 3 additions & 3 deletions main.nf
Original file line number Diff line number Diff line change
Expand Up @@ -36,8 +36,8 @@ workflow PLANTFOODRESEARCHOPEN_GENEPAL {
ch_braker_annotation
ch_braker_ex_asm_str
ch_benchmark_gff
ch_rna_sra
ch_rna_fq
ch_rna_bam
ch_rna_bam_by_assembly
ch_sortmerna_fastas
ch_ext_prot_fastas
Expand All @@ -58,8 +58,8 @@ workflow PLANTFOODRESEARCHOPEN_GENEPAL {
ch_braker_annotation,
ch_braker_ex_asm_str,
ch_benchmark_gff,
ch_rna_sra,
ch_rna_fq,
ch_rna_bam,
ch_rna_bam_by_assembly,
ch_sortmerna_fastas,
ch_ext_prot_fastas,
Expand Down Expand Up @@ -105,8 +105,8 @@ workflow {
PIPELINE_INITIALISATION.out.braker_annotation,
PIPELINE_INITIALISATION.out.braker_ex_asm_str,
PIPELINE_INITIALISATION.out.benchmark_gff,
PIPELINE_INITIALISATION.out.rna_sra,
PIPELINE_INITIALISATION.out.rna_fq,
PIPELINE_INITIALISATION.out.rna_bam,
PIPELINE_INITIALISATION.out.rna_bam_by_assembly,
PIPELINE_INITIALISATION.out.sortmerna_fastas,
PIPELINE_INITIALISATION.out.ext_prot_fastas,
Expand Down
20 changes: 20 additions & 0 deletions modules.json
Original file line number Diff line number Diff line change
Expand Up @@ -150,6 +150,11 @@
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["modules"]
},
"custom/sratoolsncbisettings": {
"branch": "master",
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["fastq_download_prefetch_fasterqdump_sratools"]
},
"diamond/makedb": {
"branch": "master",
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
Expand Down Expand Up @@ -235,6 +240,16 @@
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["modules"]
},
"sratools/fasterqdump": {
"branch": "master",
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["fastq_download_prefetch_fasterqdump_sratools"]
},
"sratools/prefetch": {
"branch": "master",
"git_sha": "666652151335353eef2fcd58880bcef5bc2928e1",
"installed_by": ["fastq_download_prefetch_fasterqdump_sratools"]
},
"star/align": {
"branch": "master",
"git_sha": "3aa0aec1d52d492fe241919f0c6100ebf0074082",
Expand Down Expand Up @@ -264,6 +279,11 @@
},
"subworkflows": {
"nf-core": {
"fastq_download_prefetch_fasterqdump_sratools": {
"branch": "master",
"git_sha": "1fc29f92e439d5631fdf34b8ac4687297d70f5ec",
"installed_by": ["subworkflows"]
},
"fastq_fastqc_umitools_fastp": {
"branch": "master",
"git_sha": "4026bab16a91b1b0b18d80ff465819ca725f33fd",
Expand Down
5 changes: 5 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/environment.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

23 changes: 23 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/main.nf

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

32 changes: 32 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/meta.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

42 changes: 42 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/tests/main.nf.test

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

14 changes: 14 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/tests/nextflow.config

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 2 additions & 0 deletions modules/nf-core/custom/sratoolsncbisettings/tests/tags.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

6 changes: 6 additions & 0 deletions modules/nf-core/sratools/fasterqdump/environment.yml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading

0 comments on commit 1e05bc6

Please sign in to comment.