Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Long due nextflowmodules update #76

Open
wants to merge 10 commits into
base: develop
Choose a base branch
from

Conversation

ffmmulder
Copy link
Contributor

Added some new tools/modules (such as AGAT, Deeplexicon,edgeR,DESeq2)
Updated module versions for all tools (to be used in v2 of NF-IAP/RNAseq pipelines)
Updated GATK tmpdir usage where not updated yet
Fixed/improved formatting where not updated yet
Other small (bug)fixes

@rernst rernst self-requested a review August 14, 2023 09:17
Copy link
Member

@rernst rernst left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work @ffmmulder

Some more general comments:

  • Double check all val input/outputs.
  • Remove old sge configs
  • Check code style, especially for long lines. Also try not to combine short and long arguments in one process, for example -i vs --input.
  • Double check choice between tuple(input1, input2) input and using multiple input channels. Did you make a choice on purpose?
input 1
input 2

shell = ['/bin/bash', '-euo', 'pipefail']

input:
tuple sample_id, file(bcf_file)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Make sure to add val: tuple(val(sample_id), file(bcf_file)).

Comment on lines 9 to 10
input:
tuple sample_id, file(bcf_file)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you add an 'output extension' input/parameter you could in theory also use the same function to convert vcf to bcf.

shell = ['/bin/bash', '-euo', 'pipefail']

input:
tuple(val(sample_id), val(rg_id), path(fastq))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

rg_id not used?

tag {"BWA_MEM2 BWASW ${sample_id} - ${rg_id}"}
label 'BWA_MEM2_2_2_1'
label 'BWA_MEM2_2_2_1_BWASW'
container = 'blcdsdockerregistry/bwa-mem2_samtools-1.12:2.2.1'
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why do you need an container with samtools? Maybe just use bwa biocontainer?

tag {"BWA_MEM2 Index $fasta"}
label 'BWA_MEM2_2_2_1'
label 'BWA_MEM2_2_2_1_Index'
container = 'library://blcdsdockerregistry/bwa-mem2_samtools-1.12:2.2.1'
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

See comment on BWASW.

tuple(val(run_id), val(interval), path(gvcf), path(gvcftbi), path(interval_file))

output:
tuple(val(run_id), val(interval), path("${run_id}.${interval}.${ext_vcf}"),path("${run_id}.${interval}.${ext_index}"),path(interval_file), emit : genotyped_vcfs)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Split on multiple lines?

tuple(val(sample_id), path(bam), path(bai), path(interval_file))

output:
tuple(val(sample_id), val(int_tag) ,path("${sample_id}.${int_tag}.${ext_vcf}"), path("${sample_id}.${int_tag}.${ext_index}"), path(interval_file), emit: htcaller_vcfs)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Split on multiple lines?

tuple(val(sample_id), path(bam_file), path(bai_file))

output:
tuple (val(sample_id), val(sample_id), path("${sample_id}.RG.bam"), path("${sample_id}.RG.bai"), emit: readgroup_bams)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Double sample_id

tuple(val(sample_id), path(bam_file), path(bai_file))

output:
tuple(val(sample_id), val(sample_id), path("${sample_id}.RG.bam"), path("${sample_id}.RG.bai"), emit: readgroup_bams)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Double sample_id

tag {"Sambamba MergeBams ${sample_id}"}
label 'Sambamba_0_8_2'
label 'Sambamba_0_8_2_MergeBams'
clusterOptions = workflow.profile == "sge" ? "-l h_vmem=${params.mem}" : ""
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

See previous comment on this

@rernst
Copy link
Member

rernst commented Oct 25, 2024

@ffmmulder What is the status of this PR?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants