Add in IRIDA Next updates Pipeline Best Practices #7

DarianHole · 2024-12-23T21:21:37Z

`Added`

nf-schema plugin and associated functions
- Schemas
- Param summary, param help, version
- samplesheetToList
params.input <CSV> to allow input samplesheets
iridanext plugin
nf-prov plugin
Required nf-core files
CI tests and linting

`Changed`

Final quality metrics output is a CSV now to work with IRIDA next
Logic for input data
Logic for skipping specific modules
- Allowed to skip el_gato ST
- Allowed to skip el_gato allele plotting
All process publishDir now in the modules.conf file
Container for allele plotting
Adjusted default warn and fail parameters for quality module based on testing
- min_reads to 60,000 from 150,000

`Updated`

Usage and README docs for the input adjustments

Issues Addressed

Solves Samplesheet input and validation #5
Solves Add parameter schema #6
Solves Test data and nf-test #4
Solves Issue with PLOT_PYSAMSTATS_TSV when running with docker #9

apetkau

This is amazing work. Thanks so much Darian 😄 . You've done a great job with this.

I have some in-line comments for you. Some of these are more meant for later PRs, or just recommendations or comments for your information for later. Please let me know if you have any questions.

.github/workflows/branch.yml

CHANGELOG.md

assets/samplesheet.csv

assets/schema_input.json

apetkau · 2025-01-15T20:24:40Z

assets/schema_input.json

+                "type": "string",
+                "pattern": "^\\S+$",
+                "errorMessage": "Sample name must be provided and cannot contain spaces",
+                "meta": ["id"]


Just as a note, there are some additional changes we will need to make for identifiers if you wish to use the sample names in IRIDA Next. But, I would recommend we leave this for a later PR, and even a later release to make things simpler.

As the pipeline is right now, this will work in IRIDA Next, but will use the IRIDA Next assigned identifiers (like INXT_1234) as the sample name instead of user-provided names (like 08-5578).

nextflow_schema.json

apetkau · 2025-01-15T21:04:15Z

nextflow_schema.json

+                },
+                "prepped_schema": {
+                    "type": "string",
+                    "default": "data/SeqSphere_1521_schema",


This can be for later, but I would recommend a bit of change in logic here for integration with IRIDA Next. Specifically, I'd recommend leaving this to a default of empty and in the pipeline, if empty, using the default data/SeqSphere_1521_schema that comes with the pipeline code.

That way, in IRIDA Next, we can always override it later on if it needs to be updated, but otherwise will default to the database distributed with the pipeline.

nextflow_schema.json

tests/main.nf.test

apetkau · 2025-01-15T21:08:38Z

tests/main.nf.test

+    }
+
+    //--- Test 4
+    test("Stub Run Test") {


Thanks so much for including this test 😄

In general, I recommend not using stub runs for testing, but instead creating very small inputs and databases and running the full pipeline including all underlying dependency software.

However, doing that all requires time. This is a good first step and later on a full test can be added in here.

Yeah, echoing this. Testing the stub run only tests that the stub part works, not that the pipeline works under normal circumstances.

Yeah I thought that it may be a good spot to put it as a test but makes sense not to. I've changed it to actual data

README.md

bin/quast_analyzer.py

conf/modules.config

nextflow_schema.json

emarinier · 2025-01-16T17:19:09Z

tests/main.nf.test

+    }
+
+    //--- Test 4
+    test("Stub Run Test") {


Yeah, echoing this. Testing the stub run only tests that the stub part works, not that the pipeline works under normal circumstances.

workflows/legiovue.nf

DarianHole added 2 commits December 23, 2024 12:30

add in the nf-core template basics

50237f1

add in nf-schema validation

fd31e7e

DarianHole changed the title ~~Add in IRIDA Next updates and work towards best practices~~ WIP: Add in IRIDA Next updates and work towards best practices Dec 23, 2024

DarianHole marked this pull request as draft December 23, 2024 21:25

DarianHole added 20 commits December 27, 2024 15:10

plug in input param fully

83f59f1

add in when statements to modules

9c813b3

adjust nf-core lint tests

fa67f23

remove trailing whitespaces from markdown files

f5c9eb9

add in stubs for stubrun

af187b1

add in ci test profile and data

afeb9cc

move to test only with docker for now

a6c190e

fix up tests

47890aa

adjust test resources, add in files that cant be skipped

5b74fee

update prettier ignore to not fail hopefully

6d0f4a5

initial add in of input tests, adjust test configs

1d0bf8c

add in simple nf-tests, doc updates

ee6a618

run prettier, fix tests to run with docker

da53277

finalize stub test, update version

f30c373

see if nf-tests work now

4353b81

add in irida-next config and outputs

2fc0a3f

testing if adding params works for github ci

830cecf

test resource updates

e5aa75b

fix up test issues

2c79540

adjust plotting to be optparse and using mulled container

848736d

DarianHole changed the base branch from main to dev January 10, 2025 14:31

DarianHole added 5 commits January 13, 2025 10:05

adjust chewbbaca to be found by iridanext

cf6f969

move publishDir to modules.conf, add in nf-prov

b2569a1

add in verion tracking

f8ba970

fix trimmomatic stub test yml

72c8c3b

adjust qc thresholds based on testing

6d20815

DarianHole added 2 commits January 15, 2025 08:58

add more to nf-test stub test

d5d1c1f

run prettier

7f6eaaf

DarianHole marked this pull request as ready for review January 15, 2025 15:01

DarianHole changed the title ~~WIP: Add in IRIDA Next updates and work towards best practices~~ Add in IRIDA Next updates Pipeline Best Practices Jan 15, 2025

DarianHole requested review from apetkau, emarinier and sgsutcliffe January 15, 2025 15:13

fix slight readme typos

76830c1

apetkau requested changes Jan 15, 2025

View reviewed changes

emarinier requested changes Jan 16, 2025

View reviewed changes

DarianHole added 4 commits January 16, 2025 15:00

address comments, hide and add parameters for more user freedom

aa58ba1

add in kraken2 database to tests

5279594

place skip lint tests into right spot

3128e72

fix up test

2bac2da

DarianHole added this to the v0.2.0 milestone Jan 20, 2025

DarianHole added 2 commits January 20, 2025 13:07

adjust some module names, adjust tests

7be3627

fix trailing whitespace issue

cc23da5

DarianHole requested review from apetkau and emarinier January 20, 2025 20:36

emarinier approved these changes Jan 22, 2025

View reviewed changes

DarianHole merged commit 7f26c64 into dev Jan 23, 2025
6 checks passed

DarianHole deleted the add/iridanext-updates branch January 24, 2025 15:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add in IRIDA Next updates Pipeline Best Practices #7

Add in IRIDA Next updates Pipeline Best Practices #7

DarianHole commented Dec 23, 2024 •

edited

Loading

apetkau left a comment

apetkau Jan 15, 2025

apetkau Jan 15, 2025

apetkau Jan 15, 2025

emarinier Jan 16, 2025

DarianHole Jan 20, 2025

emarinier Jan 16, 2025

Add in IRIDA Next updates Pipeline Best Practices #7

Add in IRIDA Next updates Pipeline Best Practices #7

Conversation

DarianHole commented Dec 23, 2024 • edited Loading

Added

Changed

Updated

Issues Addressed

apetkau left a comment

Choose a reason for hiding this comment

apetkau Jan 15, 2025

Choose a reason for hiding this comment

apetkau Jan 15, 2025

Choose a reason for hiding this comment

apetkau Jan 15, 2025

Choose a reason for hiding this comment

emarinier Jan 16, 2025

Choose a reason for hiding this comment

DarianHole Jan 20, 2025

Choose a reason for hiding this comment

emarinier Jan 16, 2025

Choose a reason for hiding this comment

DarianHole commented Dec 23, 2024 •

edited

Loading

`Added`

`Changed`

`Updated`