Workflow outputs (second preview) #30

bentsherman · 2024-11-04T16:38:22Z

Second iteration based on #27

This PR takes a different approach to workflow outputs by defining a samples target instead of having a target for each tool. The difference is harder to see here since we only publish the fastqc logs for each sample, but it is more clear for nf-core pipelines like fetchngs and rnaseq.

I originally treated each tool as an output target, but then I realized that it makes more sense to have a single output target that joins all results pertaining to a sample. This makes it much easier for a downstream pipeline to consume the outputs, because now instead of joining multiple index files, a downstream pipeline need only take a subset from the one index file.

It also publishes the salmon output, which is not essential, but demonstrates the idea of a "sample" being all results associated with a particular sample id.

The multiqc target could also be called summary, to denote that it contains the all of the summary results.

Signed-off-by: Ben Sherman <[email protected]>

Signed-off-by: Paolo Di Tommaso <[email protected]>

Signed-off-by: Dr Marco Claudio De La Pierre <[email protected]>

Signed-off-by: Ben Sherman <[email protected]>

…f into workflow-output-dsl Signed-off-by: Paolo Di Tommaso <[email protected]>

Signed-off-by: Paolo Di Tommaso <[email protected]>

Signed-off-by: Ben Sherman <[email protected]>

bentsherman · 2024-11-04T16:43:20Z

main.nf

+
+output {
+  samples {
+    path { id, _quant, _fastqc -> "${workflow.outputDir}/${id}" }


this is a bug in the workflow output DSL, it is not resolving the dynamic name against the base output directory

bentsherman · 2024-11-04T16:46:02Z

main.nf

+  samples_ch = RNASEQ.out.quant
+    | join(RNASEQ.out.fastqc)


this change is the important bit -- we are joining the metadata, fastqc logs, and quant results for each sample into a single channel, and publishing that channel

then the path target directive is used to control the output directory structure

Signed-off-by: Ben Sherman <[email protected]>

bentsherman · 2024-11-04T16:52:48Z

The output directory looks like this:

$ find results/ | sort
results/
results/ggal_gut
results/ggal_gut/fastqc
results/ggal_gut/quant
results/index.json
results/multiqc_report.html

And the index file looks like this:

[
    {
        "id": "ggal_gut",
        "quant": "/home/bent/projects/nextflow-io_rnaseq-nf/results/quant",
        "fastqc": "/home/bent/projects/nextflow-io_rnaseq-nf/results/fastqc"
    }
]

bentsherman and others added 12 commits April 12, 2024 08:08

Replace publishDir with workflow output definition

2d653c3

Signed-off-by: Ben Sherman <[email protected]>

wip

0c08ad8

Signed-off-by: Paolo Di Tommaso <[email protected]>

publish path redirection in modules

4ca46a3

Signed-off-by: Dr Marco Claudio De La Pierre <[email protected]>

Move publish redirects to workflows

772d179

Signed-off-by: Ben Sherman <[email protected]>

Merge branch 'workflow-output-dsl' of github.com:nextflow-io/rnaseq-n…

f8957dd

…f into workflow-output-dsl Signed-off-by: Paolo Di Tommaso <[email protected]>

Merge branch 'master' into workflow-output-dsl

f54b6c8

Signed-off-by: Paolo Di Tommaso <[email protected]>

Remove workflow publish redirection

bb44e64

Signed-off-by: Paolo Di Tommaso <[email protected]>

Remove publish from workflow

aeb5525

Signed-off-by: Paolo Di Tommaso <[email protected]>

Add index file

267981b

Signed-off-by: Ben Sherman <[email protected]>

Update to second preview

642dbad

Signed-off-by: Ben Sherman <[email protected]>

Refactor output directory structure

b0c2f87

Signed-off-by: Ben Sherman <[email protected]>

Merge branch 'master' into workflow-outputs-2

e514be5

Signed-off-by: Ben Sherman <[email protected]>

bentsherman commented Nov 4, 2024

View reviewed changes

refactor multiqc output target

50d7f94

Signed-off-by: Ben Sherman <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workflow outputs (second preview) #30

Workflow outputs (second preview) #30

bentsherman commented Nov 4, 2024

bentsherman Nov 4, 2024

bentsherman Nov 4, 2024

bentsherman commented Nov 4, 2024

Workflow outputs (second preview) #30

Are you sure you want to change the base?

Workflow outputs (second preview) #30

Conversation

bentsherman commented Nov 4, 2024

bentsherman Nov 4, 2024

Choose a reason for hiding this comment

bentsherman Nov 4, 2024

Choose a reason for hiding this comment

bentsherman commented Nov 4, 2024