-
Notifications
You must be signed in to change notification settings - Fork 127
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MIRTOP_STATS IndexError #477
Comments
Hi! Thank you for reporting this bug. I think I know where the issue is and I am working on a solution. Just to confirm, could you please run the same command but without the |
Hi @atrigila ! I run the command as you asked and the pipeline completed successfully, but Mirtrace and Mirtop didn't run |
The mirtop step requires both the
smrnaseq/subworkflows/local/mirna_quant.nf Lines 98 to 105 in 5901bea
This same behavior can be reproduced in older versions of the pipeline ( I will update documentation to clearly state that |
@anastasiaprime the issue should be solved now, but let us know if you have any additional questions. Just take into account that when using MirgeneDB inputs |
Hello, happy holidays to all!! I'm going through the same problem; however, I'm not using the mirGeneDB information, I'm using the flag I'm using the smrnaseq v2.4.0 and nextflow version 24.10.1 build 5930. Command line: The error returned by the pipeline:
|
Hi @epfarias , I think I was able to replicate your error using public data: Error:
The error is not exactly the same. We will have a look into this and add any updates in this ticket. |
@epfarias please try running the latest dev version to see if you still encounter the error: |
Hi @atrigila, thank you for the agility in the response, but unfortunately, the error remains. I've checked the data to see if it has an issue, but it is ok. |
I opened a new branch with a small change in the singularity container. This does not raise the error anymore for me with public data:
|
Hey Anabella, Happy New Year!! So, I tested the first option and the pipeline worked completely. However, when I used my data the error happened again, I also inserted the
Error report :
|
Hi, thank you for testing the suggested options.
|
Catching on this as well. Can you share the top lines of SRR7419066_mirtop.gff? |
Hey @lpantano , I cleaned my work directory, right after the message from @atrigila , because I was focused on understanding the data, once the problem could be with my samples. I reran the pipeline with the same samples to show you the .gff file. I've attached the print of the view from nano with the sample SRR7419086. The command as follows was used, to reproduce the error I've used an amount of 8 samples from the original 360 samples.
I do believe there is some issue with the data I'm using, maybe a problem with the barcode, because this data was initially multiplexed to do the sequencing, and the researchers uploaded the data demultiplexed without the barcode information or other additional information. I don't know, still trying to figure out what could have happened. It follows the screenshot right before the error: |
ok, thanks. I see the GFF is empty so there is a problem at this step: |
So, the the The .log files are also attached. |
the content of the run.log shows there is no miRNA sequence detected. Can you run this command on the bam file:
|
Unfortunately, another error, does not find/load the index for this bam file. It is my first time working with demultiplexed data, and I don't know what to do to solve this issue to preprocess this data. I've read some articles about what to do with multiplexed/demultiplexed data, and they said the same thing: it is necessary the barcode information associated with the multiplexing process. However, this information was not released by the data publisher. They described very superficially the cDNA library construction and the alignment process; I looked at the data structure to see if something was wrong or abnormal, except the high amount of polyA tail but the fastp must deal with this, and I did not identify anything. Follows the screenshot of the fastq file for the SRR7419086 sample, with the first 12 reads. If you could help me, and give me some insights into what it could be, I'd be very happy. |
Thanks for the extra information. You can do As well, you can do a grep of a common miRNA, for instance, let7-c or let7-b, with the raw fastq and the trimmed fastq to see where the mature sequence of the miRNA falls in the reads you have when you start and after trimming. |
Description of the bug
Hello!
I'm trying to process my small rnaseq data using only R1 reads and always get the same error. What could it be? nextflow.log
I use smrnaseq v2.4.0, nextflow version 24.04.4
I tried dev version, but had the same error.
Command line:
nextflow run nf-core/smrnaseq -profile docker --input samplesheet_1.csv --outdir Results_R1_test --fasta /mnt/cephfs8_rw/oncology/refseqs/Homo_sapiens/NCBI/GRCh38/Sequence/WholeGenomeFasta/genome.fa --mirgenedb true --mirgenedb_species Hsa --mirgenedb_mature /mnt/cephfs8_rw/oncology/refseqs/Homo_sapiens/miRNA/hsa.fas --mirgenedb_hairpin /mnt/cephfs8_rw/oncology/refseqs/Homo_sapiens/miRNA/hsa-pre.fas --mirgenedb_gff /mnt/cephfs8_rw/oncology/refseqs/Homo_sapiens/miRNA/hsa.gff --mirtrace_species hsa -c config
Config only for resources (max_cpus, max_memory)
Error:
`ERROR ~ Error executing process > 'NFCORE_SMRNASEQ:MIRNA_QUANT:BAM_STATS_MIRNA_MIRTOP:MIRTOP_STATS (770902000404_S5)'
Caused by:
Process
NFCORE_SMRNASEQ:MIRNA_QUANT:BAM_STATS_MIRNA_MIRTOP:MIRTOP_STATS (770902000404_S5)
terminated with an error exit status (1)Command executed:
mirtop
stats
--out stats
770902000404_S5_mirtop.gff
cat <<-END_VERSIONS > versions.yml$(echo $ (mirtop --version 2>&1) | sed 's/^.*mirtop //')
"NFCORE_SMRNASEQ:MIRNA_QUANT:BAM_STATS_MIRNA_MIRTOP:MIRTOP_STATS":
mirtop:
END_VERSIONS
Command exit status:
1
Command output:
['stats', '--out', 'stats', '770902000404_S5_mirtop.gff']
Command error:
WARNING: Your kernel does not support swap limit capabilities or the cgroup is not mounted. Memory limited without swap.
/opt/conda/lib/python3.12/site-packages/Bio/pairwise2.py:278: BiopythonDeprecationWarning: Bio.pairwise2 has been deprecated, and we intend to remove it in a
future release of Biopython. As an alternative, please consider using Bio.Align.PairwiseAligner as a replacement, and contact the Biopython developers if you
still need the Bio.pairwise2 module.
warnings.warn(
10/15/2024 09:29:02 INFO Run stats.
10/15/2024 09:29:02 INFO Reading: 770902000404_S5_mirtop.gff
Traceback (most recent call last):
File "/opt/conda/bin/mirtop", line 10, in
sys.exit(main())
^^^^^^
File "/opt/conda/lib/python3.12/site-packages/mirtop/command_line.py", line 34, in main
['stats', '--out', 'stats', '770902000404_S5_mirtop.gff']
stats(kwargs["args"])
File "/opt/conda/lib/python3.12/site-packages/mirtop/gff/stats.py", line 38, in stats
out.append(_calc_stats(fn))
^^^^^^^^^^^^^^^
File "/opt/conda/lib/python3.12/site-packages/mirtop/gff/stats.py", line 82, in _calc_stats
df = _summary(lines)
^^^^^^^^^^^^^^^
File "/opt/conda/lib/python3.12/site-packages/mirtop/gff/stats.py", line 130, in _summary
df_sum = _add_missing(df_sum)
^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/lib/python3.12/site-packages/mirtop/gff/stats.py", line 110, in _add_missing
df2 = pd.DataFrame({'category': category, 'sample': df['sample'].iat[0], 'counts': 0}, index=[0])
~~~~~~~~~~~~~~~~^^^
File "/opt/conda/lib/python3.12/site-packages/pandas/core/indexing.py", line 2527, in getitem
return self.obj._get_value(*key, takeable=self._takeable)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/lib/python3.12/site-packages/pandas/core/series.py", line 1234, in _get_value
return self._values[label]
~~~~~~~~~~~~^^^^^^^
IndexError: index 0 is out of bounds for axis 0 with size 0
Work dir:
/mnt/cephfs8_rw/oncology/miRNA/220118_VH00195_67_AAAHV32M5_fastq4/work/39/2e514730d2d4ad416afc2023156668
Tip: you can replicate the issue by changing to the process work dir and entering the command
bash .command.run
-- Check '.nextflow.log' file for details
ERROR ~ Pipeline failed. Please refer to troubleshooting docs: https://nf-co.re/docs/usage/troubleshooting
-- Check '.nextflow.log' file for details
`
Command used and terminal output
No response
Relevant files
No response
System information
No response
The text was updated successfully, but these errors were encountered: