-
Notifications
You must be signed in to change notification settings - Fork 127
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ERROR ~ Error executing process > 'NFCORE_SMRNASEQ:SMRNASEQ:CONTAMINANT_FILTER:MAP_RRNA (1)' #295
Comments
Ok to add to the original post, after I removed contamination filtering (because when I first removed just the rRNA, the tRNA step also failed in the same way) I got this error: [nf-core/smrnaseq] Pipeline completed with errors- Caused by: Command executed: seqcluster collapse -f CD_SPZ_REP2.fastp.fastq.gz -m 1 --min_size 15 -o collapsed cat <<-END_VERSIONS > versions.yml Command exit status: Command output: Command error: Work dir: Tip: you can try to figure out what's wrong by changing to the process work dir and showing the script file named -- Check '.nextflow.log' file for details The command was the same as before just without: |
do you think you could share that file(CD_SPZ_REP2.fastp.fastq.gz) with me? is it failing for all the samples or only that one? |
Hi, yes I will share the file with you, but upon looking at it, because I ran the command a few times, I have a few files with the same name. Some of them are like ~300Mb and look normal and some are few. There are also same files for other samples. Also I think it gives the same error with different files when you run it a few times. I will delete all the temp files, and run the command only once and share the file that is mentioned in the error. |
OK so I reran the command, turns out the other files are just links. The command now fails on different sample: Caused by: Command executed: seqcluster collapse -f CD_SPZ_REP1.fastp.fastq.gz -m 1 --min_size 15 -o collapsed cat <<-END_VERSIONS > versions.yml Command exit status: Command output: Command error: Work dir: Tip: view the complete command output by changing to the process work dir and entering the command -- Check '.nextflow.log' file for details This is the link to the file CD_SPZ_REP1.fastp.fastq.gz which is now mentioned in the error log, on google drive: |
Can the problem be the fact that the data has Phred64 quality scores? |
The original error regarding CONTAMINANT_FILTER is due the script is looking for a bowtie1 index and the module smrnaseq/modules/local/bowtie_contaminants.nf have built bowtie2 indexes. I have changed the line 25 of smrnaseq/modules/local/bowtie_map_contaminants.nf And also deleted the line 33 ( ${args} \), because it is an empty variable that is causing conflict with the output writing. Hope that helps. |
Thank you for looking into my problem, however after editing the bowtie_map_contaminants.nf file as you suggested, i get a new error: Caused by: Command executed: INDEX=find -L ./ -name "*.3.bt2" | sed 's/.3.bt2//' extracting number of reads from bowtie logsawk -v type=rRNA 'BEGIN{tot=0} {if(NR==4 || NR == 5){tot += $1}} END {print """type"": "tot }' CD_SPZ_REP2.contaminant_bowtie.log | tr -d , > filtered.CD_SPZ_REP2_rRNA.stats cat <<-END_VERSIONS > versions.yml Command exit status: Command output: Command error: Work dir: Tip: view the complete command output by changing to the process work dir and entering the command -- Check '.nextflow.log' file for details It seams that the program now thinks -L is a command and not a parameter of find function? |
upon inspection i saw that there were quotation marks missing so the line 25 should be: -[nf-core/smrnaseq] Pipeline completed with errors- Caused by: Command executed: INDEX= extracting number of reads from bowtie logsawk -v type=rRNA 'BEGIN{tot=0} {if(NR==4 || NR == 5){tot += $1}} END {print """type"": "tot }' CD_SPZ_REP2.contaminant_bowtie.log | tr -d , > filtered.CD_SPZ_REP2_rRNA.stats cat <<-END_VERSIONS > versions.yml Command exit status: Command output: Work dir: Tip: view the complete command output by changing to the process work dir and entering the command -- Check '.nextflow.log' file for details |
There is also a backtick missing, try this: In fact, there are some other modifications that can be made. I follow this pull request (#294) and fixed the contamination_filter step. |
See #303 for hopefully permanent fixes now |
Description of the bug
I am trying to run the smRNAseq pipeline but keep getting error during mapping rRNA fasta file for contaminant filtering
Command used and terminal output
Relevant files
nextflow.log
System information
Nextflow version: version 23.10.0
Hardware: Desktop
Executor: local
Container engine: docker
Version of nf-core/smrnaseq: v2.2.3
The text was updated successfully, but these errors were encountered: