Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error during rMATS analysis in the nextflow #442

Open
Motoufiq opened this issue Oct 23, 2024 · 5 comments
Open

Error during rMATS analysis in the nextflow #442

Motoufiq opened this issue Oct 23, 2024 · 5 comments

Comments

@Motoufiq
Copy link

Motoufiq commented Oct 23, 2024

Hi,

I am running a nextflow pipeline and rMATS is part of the pipeline. All the steps run well except the rMATS step. How should I overcome this errror? See the attached log below for reference.
splicing.log.txt
rmats_pairs.txt

@EricKutschera
Copy link
Contributor

It looks like this is the error:

 Error in `$<-.data.frame`(`*tmp*`, "Event_type", value = "CA") :
    replacement has 1 row, data has 0
  Calls: get_postprocessing_table -> $<- -> $<-.data.frame

That error happens after rMATS completes. However it looks like most of the reads were filtered out by rMATS:

USED: 182296943
NOT_PAIRED: 5149055
NOT_NH_1: 256246142
NOT_EXPECTED_CIGAR: 21947137
NOT_EXPECTED_READ_LENGTH: 0
NOT_EXPECTED_STRAND: 0
EXON_NOT_MATCHED_TO_ANNOTATION: 688568009
JUNCTION_NOT_MATCHED_TO_ANNOTATION: 592092171
CLIPPED: 0
total: 1746299457

Only 10% (182296943/1746299457) of reads were USED, while 73% ((688568009 + 592092171) / 1746299457) were filtered for NOT_MATCHED_TO_ANNOTATION

I'm not familiar with the pipeline you are running, but my guess is that either the alignment step in the pipeline used different reference files than what was given to rMATS, or your reads just don't align well

@Motoufiq
Copy link
Author

@EricKutschera thank you. It was something to do with the setting true or false paired_stats. My samples were unpaired.

@Motoufiq
Copy link
Author

@EricKutschera one question related the parameter setting in the rMATS analysis. If samples are paired then I should --paired-stats is true, if unpaired, then I should set false. Is my understanding correct?

@EricKutschera
Copy link
Contributor

If your samples are paired then you can run with --paired-stats, but it's also fine to run without --paired-stats. The default statistical model can compare any two groups of samples. The paired statistical model can use the pairing information to potentially give better results: #356 (comment)

@Motoufiq
Copy link
Author

@EricKutschera thank you. My samples are unpaired so I will set --paired-stats as false

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants