Delete lib directory and replace with utils_* subworkflows #1197

drpatelh · 2024-01-20T18:22:54Z

Delete lib/ directory and replace with utils_* subworkflows installed from nf-core/modules
Split out PREPARE_GENOME subworkflow to run outside of main rnaseq workflow
Remove CUSTOM_DUMPSOFTWAREVERSIONS to use native MultiQC software version reporting
Replace local MultiQC module with the one from nf-core/modules
Remove test_cache profile as this will be superseded by using test data from S3 whilst porting to nf-test

github-actions · 2024-01-20T18:24:46Z

`nf-core lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 3734375

+| ✅ 142 tests passed       |+
#| ❔  10 tests were ignored |#
!| ❗   5 tests had warnings |!

❗ Test warnings:

files_exist - File not found: assets/multiqc_config.yml
files_exist - File not found: .github/workflows/awstest.yml
files_exist - File not found: .github/workflows/awsfulltest.yml
files_exist - File not found: lib/WorkflowRnaseq.groovy
pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your preferred methods description, e.g. add publication citation for this pipeline

❔ Tests ignored:

files_exist - File is ignored: lib/nfcore_external_java_deps.jar
files_exist - File is ignored: lib/NfcoreTemplate.groovy
files_exist - File is ignored: lib/Utils.groovy
files_exist - File is ignored: lib/WorkflowMain.groovy
files_unchanged - File ignored due to lint config: assets/email_template.html
files_unchanged - File ignored due to lint config: assets/email_template.txt
files_unchanged - File does not exist: lib/nfcore_external_java_deps.jar
files_unchanged - File does not exist: lib/NfcoreTemplate.groovy
actions_awstest - 'awstest.yml' workflow not found: /home/runner/work/rnaseq/rnaseq/.github/workflows/awstest.yml
multiqc_config - 'assets/multiqc_config.yml' not found

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-rnaseq_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-rnaseq_logo_light.png
files_exist - File found: docs/images/nf-core-rnaseq_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: main.nf
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: modules.json
files_exist - File found: pyproject.toml
files_exist - File not found check: Singularity
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: docs/images/nf-core-rnaseq_logo.png
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: .travis.yml
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: params.validationShowHiddenParams
nextflow_config - Config variable found: params.validationSchemaIgnoreParams
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 3.15.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-rnaseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-rnaseq_logo_light.png matches the template
files_unchanged - docs/images/nf-core-rnaseq_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
files_unchanged - pyproject.toml matches the template
actions_ci - '.github/workflows/ci.yml' is triggered on expected events
actions_ci - '.github/workflows/ci.yml' checks minimum NF version
readme - README Nextflow minimum version badge matched config. Badge: 23.04.0, Config: 23.04.0
readme - README Zenodo placeholder was replaced with DOI.
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (284 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: cloud_tests_small.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: cloud_tests_full.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'

Run details

nf-core/tools version 2.11.1
Run at 2024-01-21 11:13:50

maxulysse

I LOOOOOVE IT

maxulysse · 2024-01-22T09:57:06Z

Just please not that there is still an issue with the monochrome_logs params:

> nextflow run drpatelh/nf-core-rnaseq -r remove_lib -profile test,docker --outdir results
N E X T F L O W  ~  version 23.04.3
Pulling drpatelh/nf-core-rnaseq ...
 downloaded from https://github.com/drpatelh/nf-core-rnaseq.git
Launching `https://github.com/drpatelh/nf-core-rnaseq` [pedantic_bassi] DSL2 - revision: 37343750f1 [remove_lib]
WARN: Access to undefined parameter `monochromeLogs` -- Initialise it to a default value eg. `params.monochromeLogs = some_value`

drpatelh · 2024-01-22T10:02:45Z

Just please not that there is still an issue with the monochrome_logs params:

Yep, I think this is actually coming from the nf-validation plugin because I see the same issue with nf-core/fetchngs. Can you create an issue there please? I started trying to look at the nf-validation code base to find the issue but got distracted.

adamrtalbot

Future thoughts for as we update this.

adamrtalbot · 2024-01-22T11:15:11Z

subworkflows/local/prepare_genome/main.nf

 workflow PREPARE_GENOME {
    take:
-    fasta                //      file: /path/to/genome.fasta
-    gtf                  //      file: /path/to/genome.gtf
-    gff                  //      file: /path/to/genome.gff
-    additional_fasta     //      file: /path/to/additional.fasta
-    transcript_fasta     //      file: /path/to/transcript.fasta
-    gene_bed             //      file: /path/to/gene.bed
-    splicesites          //      file: /path/to/splicesites.txt
-    bbsplit_fasta_list   //      file: /path/to/bbsplit_fasta_list.txt
-    star_index           // directory: /path/to/star/index/
-    rsem_index           // directory: /path/to/rsem/index/
-    salmon_index         // directory: /path/to/salmon/index/
-    kallisto_index       // directory: /path/to/kallisto/index/
-    hisat2_index         // directory: /path/to/hisat2/index/
-    bbsplit_index        // directory: /path/to/rsem/index/
-    gencode              //   boolean: whether the genome is from GENCODE
-    is_aws_igenome       //   boolean: whether the genome files are from AWS iGenomes
-    biotype              //    string: if additional fasta file is provided biotype value to use when appending entries to GTF file
-    prepare_tool_indices //      list: tools to prepare indices for
-    filter_gtf           //   boolean: whether to filter GTF file
+    fasta                    //      file: /path/to/genome.fasta
+    gtf                      //      file: /path/to/genome.gtf
+    gff                      //      file: /path/to/genome.gff
+    additional_fasta         //      file: /path/to/additional.fasta
+    transcript_fasta         //      file: /path/to/transcript.fasta
+    gene_bed                 //      file: /path/to/gene.bed
+    splicesites              //      file: /path/to/splicesites.txt
+    bbsplit_fasta_list       //      file: /path/to/bbsplit_fasta_list.txt
+    star_index               // directory: /path/to/star/index/
+    rsem_index               // directory: /path/to/rsem/index/
+    salmon_index             // directory: /path/to/salmon/index/
+    kallisto_index           // directory: /path/to/kallisto/index/
+    hisat2_index             // directory: /path/to/hisat2/index/
+    bbsplit_index            // directory: /path/to/rsem/index/
+    gencode                  //   boolean: whether the genome is from GENCODE
+    featurecounts_group_type //    string: The attribute type used to group feature types in the GTF file when generating the biotype plot with featureCounts
+    aligner                  //    string: Specifies the alignment algorithm to use - available options are 'star_salmon', 'star_rsem' and 'hisat2'
+    pseudo_aligner           //    string: Specifies the pseudo aligner to use - available options are 'salmon'. Runs in addition to '--aligner'
+    skip_gtf_filter          //   boolean: Skip filtering of GTF for valid scaffolds and/ or transcript IDs
+    skip_bbsplit             //   boolean: Skip BBSplit for removal of non-reference genome reads
+    skip_alignment           //   boolean: Skip all of the alignment-based processes within the pipeline
+    skip_pseudo_alignment    //   boolean: Skip all of the pseudoalignment-based processes within the pipeline


Looking at this, I think this subworkflow is doing waaayyyyy too much.

adamrtalbot · 2024-01-22T11:18:56Z

subworkflows/local/prepare_genome/main.nf

+        // Determine whether to filter the GTF or not
+        def filter_gtf = 
+            ((
+                // Condition 1: Alignment is required and aligner is set
+                !skip_alignment && aligner
+            ) || 
+            (
+                // Condition 2: Pseudoalignment is required and pseudoaligner is set
+                !skip_pseudo_alignment && pseudo_aligner
+            ) || 
+            (
+                // Condition 3: Transcript FASTA file is not provided
+                !transcript_fasta
+            )) &&
+            (
+                // Condition 4: --skip_gtf_filter is not provided
+                !skip_gtf_filter
+            )


Way too much stuff going on here. We should be able to simplify this by clarifying what filter_gtf means. What are we filtering from the GTF? Why?

adamrtalbot · 2024-01-22T11:22:17Z

subworkflows/local/utils_nfcore_rnaseq_pipeline/main.nf

+    //
+    // Print version and exit if required and dump pipeline parameters to JSON file
+    //
+    UTILS_NEXTFLOW_PIPELINE (


Still not happy with the name UTILS_NEXTFLOW_PIPELINE, NEXTFLOW_PIPELINE_UTILITIES makes more sense but it still doesn't say what the workflow actually does, making it hard to read.

If we didn't know what the subworkflow did, what would we think this was doing?

adamrtalbot · 2024-01-22T11:22:29Z

subworkflows/local/utils_nfcore_rnaseq_pipeline/main.nf

+    def pre_help_text = nfCoreLogo(params.monochrome_logs)
+    def post_help_text = '\n' + workflowCitation() + '\n' + dashedLine(params.monochrome_logs)
+    def String workflow_command = "nextflow run ${workflow.manifest.name} -profile <docker/singularity/.../institute> --input samplesheet.csv --genome GRCh37 --outdir <OUTDIR>"
+    UTILS_NFVALIDATION_PLUGIN (


Same problem here.

adamrtalbot · 2024-01-22T11:26:40Z

subworkflows/local/utils_nfcore_rnaseq_pipeline/main.nf

+========================================================================================
+*/
+
+workflow PIPELINE_INITIALISATION {


This name is slightly better, but a function name is a verb so saying what it is doing is more clear for the developer.

adamrtalbot · 2024-01-22T11:28:29Z

subworkflows/local/utils_nfcore_rnaseq_pipeline/main.nf

+    //
+    // Custom validation for pipeline parameters
+    //
+    validateInputParameters()


I looked at this and without looking at the code knew exactly what it was doing. A+.

drpatelh added 4 commits January 20, 2024 13:44

Move main workflow into it's own directory

cbdfbc6

Remove nfcore_external_java_deps.jar

91c5742

Install utils subworkflows from nf-core modules

c85a4cb

Delete lib directory and replace with utils_* subworkflows

15924c4

drpatelh added 2 commits January 20, 2024 18:41

Fix linting

433b455

Add prepare_tool_indices logic to rnaseq workflow

b0fc8cb

drpatelh marked this pull request as draft January 20, 2024 19:27

drpatelh changed the base branch from nf-test to config_refactor January 21, 2024 09:39

drpatelh added 2 commits January 21, 2024 09:55

Move rrna-db-defaults.txt to workflow assets

2c53607

Replace local MultiQC module with one from nf-core/modules

82f0eea

drpatelh marked this pull request as ready for review January 21, 2024 10:57

Remove test_cache profile

3734375

maxulysse approved these changes Jan 22, 2024

View reviewed changes

drpatelh merged commit 9308fd2 into nf-core:config_refactor Jan 22, 2024
29 checks passed

adamrtalbot reviewed Jan 22, 2024

View reviewed changes

adamrtalbot mentioned this pull request Feb 5, 2024

Remove lib directory and modules.config #1206

Merged

This was referenced Mar 11, 2024

Replace local MULTIQC with nf-core version #1168

Closed

MultiQC module redundancy #1205

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delete lib directory and replace with utils_* subworkflows #1197

Delete lib directory and replace with utils_* subworkflows #1197

drpatelh commented Jan 20, 2024 •

edited

Loading

github-actions bot commented Jan 20, 2024 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

maxulysse left a comment

maxulysse commented Jan 22, 2024

drpatelh commented Jan 22, 2024 •

edited

Loading

adamrtalbot left a comment

adamrtalbot Jan 22, 2024

adamrtalbot Jan 22, 2024

adamrtalbot Jan 22, 2024

adamrtalbot Jan 22, 2024

adamrtalbot Jan 22, 2024

adamrtalbot Jan 22, 2024

Delete lib directory and replace with utils_* subworkflows #1197

Delete lib directory and replace with utils_* subworkflows #1197

Conversation

drpatelh commented Jan 20, 2024 • edited Loading

github-actions bot commented Jan 20, 2024 • edited Loading

nf-core lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

maxulysse left a comment

Choose a reason for hiding this comment

maxulysse commented Jan 22, 2024

drpatelh commented Jan 22, 2024 • edited Loading

adamrtalbot left a comment

Choose a reason for hiding this comment

adamrtalbot Jan 22, 2024

Choose a reason for hiding this comment

adamrtalbot Jan 22, 2024

Choose a reason for hiding this comment

adamrtalbot Jan 22, 2024

Choose a reason for hiding this comment

adamrtalbot Jan 22, 2024

Choose a reason for hiding this comment

adamrtalbot Jan 22, 2024

Choose a reason for hiding this comment

adamrtalbot Jan 22, 2024

Choose a reason for hiding this comment

drpatelh commented Jan 20, 2024 •

edited

Loading

github-actions bot commented Jan 20, 2024 •

edited

Loading

`nf-core lint` overall result: Passed ✅ ⚠️

drpatelh commented Jan 22, 2024 •

edited

Loading