Update: Include sample_name IRIDA-Next input column #26

sgsutcliffe · 2024-09-16T15:58:25Z

Modified the template for input samplesheet.csv file to include the sample_name column in addition to sample in-line with changes to IRIDA-Next update as seen with the speciesabundance pipeline and staramrnf. What this means is that the output files and the sample name will be changed to sample_name if a sample_name is called. If snvphylnfc is being locally then the sample_name can be left blank.

Made a few changes:
- sample_name special characters will be replaced with "_"
- If no sample_name is supplied in the column sample will be used
- To avoid repeat values for sample_name all sample_name values will be suffixed with sample
- Tests to check that the variety of different sample_names work with the

PR checklist

github-actions · 2024-09-16T15:59:47Z

`nf-core lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 1419462

+| ✅ 143 tests passed       |+
#| ❔  27 tests were ignored |#
!| ❗   3 tests had warnings |!

❗ Test warnings:

nextflow_config - Config manifest.version should end in dev: 2.1.1
schema_lint - Schema $id should be https://raw.githubusercontent.com/phac-nml/snvphylnfc/master/nextflow_schema.json
Found https://raw.githubusercontent.com/phac-nml/snvphylnfc/main/nextflow_schema.json
nfcore_yml - nf-core version not set in .nf-core.yml

❔ Tests ignored:

files_exist - File is ignored: assets/nf-core-snvphylnfc_logo_light.png
files_exist - File is ignored: docs/images/nf-core-snvphylnfc_logo_light.png
files_exist - File is ignored: docs/images/nf-core-snvphylnfc_logo_dark.png
files_exist - File is ignored: .github/workflows/awstest.yml
files_exist - File is ignored: .github/workflows/awsfulltest.yml
files_exist - File is ignored: lib/Utils.groovy
files_exist - File is ignored: lib/WorkflowMain.groovy
files_exist - File is ignored: lib/NfcoreTemplate.groovy
files_exist - File is ignored: lib/WorkflowSnvphylnfc.groovy
nextflow_config - Config variable ignored: manifest.name
nextflow_config - Config variable ignored: manifest.homePage
files_unchanged - File ignored due to lint config: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_unchanged - File ignored due to lint config: .github/CONTRIBUTING.md
files_unchanged - File ignored due to lint config: .github/ISSUE_TEMPLATE/bug_report.yml
files_unchanged - File ignored due to lint config: .github/PULL_REQUEST_TEMPLATE.md
files_unchanged - File ignored due to lint config: .github/workflows/branch.yml
files_unchanged - File ignored due to lint config: .github/workflows/linting.yml
files_unchanged - File ignored due to lint config: assets/email_template.html
files_unchanged - File ignored due to lint config: assets/email_template.txt
files_unchanged - File ignored due to lint config: assets/sendmail_template.txt
files_unchanged - File does not exist: assets/nf-core-snvphylnfc_logo_light.png
files_unchanged - File does not exist: docs/images/nf-core-snvphylnfc_logo_light.png
files_unchanged - File does not exist: docs/images/nf-core-snvphylnfc_logo_dark.png
files_unchanged - File ignored due to lint config: docs/README.md
actions_awstest - 'awstest.yml' workflow not found: /home/runner/work/snvphylnfc/snvphylnfc/.github/workflows/awstest.yml
actions_awsfulltest - actions_awsfulltest
pipeline_name_conventions - pipeline_name_conventions

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: modules.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-snvphylnfc_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: params.validationShowHiddenParams
nextflow_config - Config variable found: params.validationSchemaIgnoreParams
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.metadata_1_header= metadata_1
nextflow_config - Config default value correct: params.metadata_2_header= metadata_2
nextflow_config - Config default value correct: params.metadata_3_header= metadata_3
nextflow_config - Config default value correct: params.metadata_4_header= metadata_4
nextflow_config - Config default value correct: params.metadata_5_header= metadata_5
nextflow_config - Config default value correct: params.metadata_6_header= metadata_6
nextflow_config - Config default value correct: params.metadata_7_header= metadata_7
nextflow_config - Config default value correct: params.metadata_8_header= metadata_8
nextflow_config - Config default value correct: params.min_coverage_depth= 15
nextflow_config - Config default value correct: params.min_mapping_percent_cov= 80
nextflow_config - Config default value correct: params.min_mean_mapping_quality= 30
nextflow_config - Config default value correct: params.window_size= 500
nextflow_config - Config default value correct: params.density_threshold= 2
nextflow_config - Config default value correct: params.snv_abundance_ratio= 0.75
nextflow_config - Config default value correct: params.min_repeat_length= 150
nextflow_config - Config default value correct: params.min_repeat_pid= 90
nextflow_config - Config default value correct: params.max_cpus= 4
nextflow_config - Config default value correct: params.max_memory= 2.GB
nextflow_config - Config default value correct: params.max_time= 1.h
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.validate_params= true
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_ci - '.github/workflows/ci.yml' is triggered on expected events
actions_ci - '.github/workflows/ci.yml' checks minimum NF version
readme - README Zenodo placeholder was replaced with DOI.
pipeline_todos - No TODO strings found
template_strings - Did not find any Jinja template strings (119 files)
schema_lint - Schema lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: branch.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
multiqc_config - assets/multiqc_config.yml found and not ignored.
multiqc_config - assets/multiqc_config.yml contains report_section_order
multiqc_config - assets/multiqc_config.yml contains export_plots
multiqc_config - assets/multiqc_config.yml contains report_comment
multiqc_config - assets/multiqc_config.yml follows the ordering scheme of the minimally required plugins.
multiqc_config - assets/multiqc_config.yml contains 'export_plots: true'.
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
base_config - CUSTOM_DUMPSOFTWAREVERSIONS found in conf/base.config and Nextflow scripts.
modules_config - conf/modules.config found and not ignored.
modules_config - CUSTOM_DUMPSOFTWAREVERSIONS found in conf/modules.config and Nextflow scripts.
modules_config - CONSOLIDATE_BCFS found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline

Run details

nf-core/tools version 2.14.1
Run at 2024-09-23 21:35:44

sgsutcliffe · 2024-09-17T13:32:13Z

Tested it in IRIDA-Next locally. Looks good!

sgsutcliffe · 2024-09-17T20:17:54Z

An issue that has arisen due to these changes is that when we modify the sample_name to allow it correspond to allowable characters for the pipeline we break the expected outcome of the function select_reference. In addition now it is possible may use the sample column still to select the reference fasta files.

CHANGELOG.md

README.md

workflows/snvphylnfc.nf

Approved the wrong PR

kylacochrane

Just the one change in the CHANGELOG 😄
Great job Steven!

kylacochrane · 2024-09-19T22:01:18Z

CHANGELOG.md

+- Modified the template for input csv file to include a `sample_name` column in addition to `sample` in-line with changes to [IRIDA-Next update] as seen with the [speciesabundance pipeline]
+  - `sample_name` special characters will be replaced with `"_"`
+  - If no `sample_name` is supplied in the column `sample` will be used
+  - To avoid repeat values for `sample_name` all `sample_name` values will be suffixed with the index of the `input` samplesheet.csv


Just checking on this - is the plan to use the index or append sample_name?

There is always one place in documentation where I forget to update things to the newest version! Thanks for catching this!

kylacochrane · 2024-09-19T22:03:08Z

README.md

+`sample` is a unique identifier, designed to be used internally or in IRIDA-Next, or when `sample_name` is not provided.
+
+`sample_name`, allows more flexibility in naming output files or sample identification. Unlike `sample`, `sample_name` is not required to contain unique values. `Nextflow` requires unique sample names, and therefore in the instance of repeat `sample_names`, `sample` will be suffixed to any `sample_name`. Non-alphanumeric characters (excluding `_`,`-`,`.`) will be replaced with `"_"`.
+


I think this description was much needed!!!
Just one comment on how it slightly differs from the CHANGELOG.md where index was suggested as the suffix should there be repeat sample_names 😄

kylacochrane · 2024-09-19T22:04:54Z

workflows/snvphylnfc.nf

@@ -243,7 +261,8 @@ def select_reference(refgenome, reference_sample_id, sample_assemblies) {
        log.debug "Selecting reference genome ${reference_genome} from '--refgenome'."
    }
    else if (reference_sample_id) {
-        reference_genome = sample_assemblies.filter { it[0] == reference_sample_id && it[1] != null}
+        // Check each meta category (meta.id, meta.id_alt, meta.irida_id) for a match to params.reference_sample_id
+        reference_genome = sample_assemblies.filter { (it[0].id == reference_sample_id || it[0].irida_id == reference_sample_id || it[0].id_alt == reference_sample_id) && it[1] != null}
                                            .ifEmpty { error("The provided reference sample ID (${reference_sample_id}) is either missing or has no associated reference assembly.") }


apetkau

Thanks so much for implementing this Steven, everyone else for their comments 😄

I ran this in IRIDA Next and it all works for me. No other comments.

kylacochrane

This looks great Steven!

sgsutcliffe · 2024-09-23T20:42:00Z

Added in a change suggested by @emarinier to remove the necessity of adding in meta.id_alt to the schema.

This reverts commit 6fdd3ab.

This reverts commit 305bf21.

This reverts commit 4170990.

This reverts commit 1fc6261.

This reverts commit 66a2b48.

sgsutcliffe · 2024-09-23T21:35:49Z

Rather than troubleshooting why the change without meta.id_alt was not working I decided to go with the original solution.

Adding the sample_name column to the samplesheet

73699fe

sgsutcliffe changed the base branch from main to dev September 16, 2024 15:59

phac-nml deleted a comment from github-actions bot Sep 16, 2024

sgsutcliffe added 3 commits September 16, 2024 13:17

Rename test data to match sample_name

445373e

Changed names - fix rename typo

dc8f4bc

Fixed the nf-tests failing

3a3efc3

phac-nml deleted a comment from github-actions bot Sep 16, 2024

sgsutcliffe requested review from apetkau, emarinier and kylacochrane September 16, 2024 17:46

sgsutcliffe added 2 commits September 17, 2024 16:22

Changes to how select_reference function selects the reference assembly

c33fed7

Fix formatting

d6ca94a

emarinier requested changes Sep 18, 2024

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Show resolved Hide resolved

workflows/snvphylnfc.nf Outdated Show resolved Hide resolved

kylacochrane previously approved these changes Sep 18, 2024

View reviewed changes

kylacochrane self-requested a review September 18, 2024 19:00

sgsutcliffe added 3 commits September 18, 2024 15:39

Reworded sample vs sample_name section

38aa13c

Fixed typo

992021b

Formatting

499509e

Fixed comment block

1b3dee5

kylacochrane requested changes Sep 19, 2024

View reviewed changes

Updated the wording on the changes

6fdd3ab

apetkau approved these changes Sep 23, 2024

View reviewed changes

kylacochrane approved these changes Sep 23, 2024

View reviewed changes

Remove necessity of meta.id_alt key

66a2b48

emarinier approved these changes Sep 23, 2024

View reviewed changes

sgsutcliffe added 7 commits September 23, 2024 17:00

Reads metadata fix

1fc6261

formatting fix

4170990

Revert "Updated the wording on the changes"

305bf21

This reverts commit 6fdd3ab.

Reapply "Updated the wording on the changes"

b98ac43

This reverts commit 305bf21.

Revert "formatting fix"

d676e97

This reverts commit 4170990.

Revert "Reads metadata fix"

d23da98

This reverts commit 1fc6261.

Revert "Remove necessity of meta.id_alt key"

1419462

This reverts commit 66a2b48.

sgsutcliffe merged commit d6d8796 into dev Sep 23, 2024
4 checks passed

sgsutcliffe deleted the add-sample-name branch September 23, 2024 21:42

sgsutcliffe mentioned this pull request Oct 18, 2024

Release 2.2.0 #27

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update: Include sample_name IRIDA-Next input column #26

Update: Include sample_name IRIDA-Next input column #26

sgsutcliffe commented Sep 16, 2024 •

edited

Loading

github-actions bot commented Sep 16, 2024 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

sgsutcliffe commented Sep 17, 2024

sgsutcliffe commented Sep 17, 2024

kylacochrane left a comment

kylacochrane Sep 19, 2024

sgsutcliffe Sep 20, 2024

kylacochrane Sep 19, 2024

kylacochrane Sep 19, 2024

apetkau left a comment

kylacochrane left a comment

sgsutcliffe commented Sep 23, 2024

sgsutcliffe commented Sep 23, 2024

		`sample` is a unique identifier, designed to be used internally or in IRIDA-Next, or when `sample_name` is not provided.

		`sample_name`, allows more flexibility in naming output files or sample identification. Unlike `sample`, `sample_name` is not required to contain unique values. `Nextflow` requires unique sample names, and therefore in the instance of repeat `sample_names`, `sample` will be suffixed to any `sample_name`. Non-alphanumeric characters (excluding `_`,`-`,`.`) will be replaced with `"_"`.

Update: Include sample_name IRIDA-Next input column #26

Update: Include sample_name IRIDA-Next input column #26

Conversation

sgsutcliffe commented Sep 16, 2024 • edited Loading

PR checklist

github-actions bot commented Sep 16, 2024 • edited Loading

nf-core lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

sgsutcliffe commented Sep 17, 2024

sgsutcliffe commented Sep 17, 2024

kylacochrane left a comment

Choose a reason for hiding this comment

kylacochrane Sep 19, 2024

Choose a reason for hiding this comment

sgsutcliffe Sep 20, 2024

Choose a reason for hiding this comment

kylacochrane Sep 19, 2024

Choose a reason for hiding this comment

kylacochrane Sep 19, 2024

Choose a reason for hiding this comment

apetkau left a comment

Choose a reason for hiding this comment

kylacochrane left a comment

Choose a reason for hiding this comment

sgsutcliffe commented Sep 23, 2024

sgsutcliffe commented Sep 23, 2024

sgsutcliffe commented Sep 16, 2024 •

edited

Loading

github-actions bot commented Sep 16, 2024 •

edited

Loading

`nf-core lint` overall result: Passed ✅ ⚠️