-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Long due nextflowmodules update #76
base: develop
Are you sure you want to change the base?
Conversation
Release/v0.7.1
Release/v0.7.2
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice work @ffmmulder
Some more general comments:
- Double check all
val
input/outputs. - Remove old sge configs
- Check code style, especially for long lines. Also try not to combine short and long arguments in one process, for example
-i
vs--input
. - Double check choice between
tuple(input1, input2)
input and using multiple input channels. Did you make a choice on purpose?
input 1
input 2
BCFtools/1.17/MergeVCF.nf
Outdated
shell = ['/bin/bash', '-euo', 'pipefail'] | ||
|
||
input: | ||
tuple sample_id, file(bcf_file) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Make sure to add val: tuple(val(sample_id), file(bcf_file))
.
BCFtools/1.17/View.nf
Outdated
input: | ||
tuple sample_id, file(bcf_file) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If you add an 'output extension' input/parameter you could in theory also use the same function to convert vcf to bcf.
shell = ['/bin/bash', '-euo', 'pipefail'] | ||
|
||
input: | ||
tuple(val(sample_id), val(rg_id), path(fastq)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rg_id not used?
tag {"BWA_MEM2 BWASW ${sample_id} - ${rg_id}"} | ||
label 'BWA_MEM2_2_2_1' | ||
label 'BWA_MEM2_2_2_1_BWASW' | ||
container = 'blcdsdockerregistry/bwa-mem2_samtools-1.12:2.2.1' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do you need an container with samtools? Maybe just use bwa biocontainer?
tag {"BWA_MEM2 Index $fasta"} | ||
label 'BWA_MEM2_2_2_1' | ||
label 'BWA_MEM2_2_2_1_Index' | ||
container = 'library://blcdsdockerregistry/bwa-mem2_samtools-1.12:2.2.1' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See comment on BWASW.
GATK/4.3.0.0/GenotypeGVCFs.nf
Outdated
tuple(val(run_id), val(interval), path(gvcf), path(gvcftbi), path(interval_file)) | ||
|
||
output: | ||
tuple(val(run_id), val(interval), path("${run_id}.${interval}.${ext_vcf}"),path("${run_id}.${interval}.${ext_index}"),path(interval_file), emit : genotyped_vcfs) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Split on multiple lines?
GATK/4.3.0.0/HaplotypeCaller.nf
Outdated
tuple(val(sample_id), path(bam), path(bai), path(interval_file)) | ||
|
||
output: | ||
tuple(val(sample_id), val(int_tag) ,path("${sample_id}.${int_tag}.${ext_vcf}"), path("${sample_id}.${int_tag}.${ext_index}"), path(interval_file), emit: htcaller_vcfs) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Split on multiple lines?
tuple(val(sample_id), path(bam_file), path(bai_file)) | ||
|
||
output: | ||
tuple (val(sample_id), val(sample_id), path("${sample_id}.RG.bam"), path("${sample_id}.RG.bai"), emit: readgroup_bams) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Double sample_id
tuple(val(sample_id), path(bam_file), path(bai_file)) | ||
|
||
output: | ||
tuple(val(sample_id), val(sample_id), path("${sample_id}.RG.bam"), path("${sample_id}.RG.bai"), emit: readgroup_bams) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Double sample_id
Sambamba/0.8.2/MergeBams.nf
Outdated
tag {"Sambamba MergeBams ${sample_id}"} | ||
label 'Sambamba_0_8_2' | ||
label 'Sambamba_0_8_2_MergeBams' | ||
clusterOptions = workflow.profile == "sge" ? "-l h_vmem=${params.mem}" : "" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See previous comment on this
@ffmmulder What is the status of this PR? |
Added some new tools/modules (such as AGAT, Deeplexicon,edgeR,DESeq2)
Updated module versions for all tools (to be used in v2 of NF-IAP/RNAseq pipelines)
Updated GATK tmpdir usage where not updated yet
Fixed/improved formatting where not updated yet
Other small (bug)fixes