Few questions regarding rMATS #400

tanya-lasagne · 2024-05-07T03:24:18Z

Hello rMATS community,
I'm new to alternative splicing (AS) analysis and seeking guidance on a few topics. Any help would be greatly appreciated!

When running a paired analysis with the --paired-stats flag, should I focus on candidate genes with low p-values? I'm currently comparing soybeans grown under optimal conditions vs. deviated conditions.
Does rMATS calculate the percentage or likelihood of a splicing variant occurring within the specified conditions? Or perhaps provide any metrics for the frequency or abundance of these variants?

Thank you!

EricKutschera · 2024-05-07T19:11:59Z

Selecting events with a low p-value is reasonable. This post has some suggestions about cutoffs: #320

The IncLevel columns (IncLevel1, IncLevel2) are PSI values (Percent Spliced In). The IncLevel is the proportion of the inclusion isoform found for each event and the inclusion isoform is shown in the README: https://github.com/Xinglab/rmats-turbo/tree/v4.3.0?tab=readme-ov-file#output

The columns like IJC_SAMPLE_1 and SJC_SAMPLE_1 give the supporting read counts for each isoform

tanya-lasagne · 2024-05-08T04:22:21Z

Thank you Eric!
Last question, I also noticed that some FDR and p-values are calculated as 0, should I omit those or do you think rMATS is detecting extreme significance in those cases?

EricKutschera · 2024-05-08T13:22:39Z

Zero values for FDR or p-value should be interpreted as very significant. The software has some numerical limits and very small values become zero. Here's a related post: https://groups.google.com/g/rmats-user-group/c/TW534af62fg/m/tZXBs0Y4BAAJ

tanya-lasagne · 2024-05-17T05:38:31Z

Hi Eric,
Thanks again for your assistance! I have a few follow-up questions:

1)When analyzing skipped exon events using rMATS, how can I precisely identify which exon is skipped (e.g., exon 3) within each gene (e.g., gene A)? Are there specific output files or columns that indicate this information?

2)Does rMATS produce gene expression data, such as read counts or FPKM values, for the splicing variants detected? I aim to integrate this data into a gene network inference software to predict gene-to-gene interactions and understand how splicing variants affect gene regulation. If rMATS doesn't provide this information, are there other tools or pipelines you recommend for obtaining splicing variant expression data?

EricKutschera · 2024-05-17T12:51:26Z

In files like SE.MATS.JC.txt the columns exonStart_0base and exonEnd give the coordinates of the exon being skipped:
https://github.com/Xinglab/rmats-turbo/tree/v4.3.0?tab=readme-ov-file#output

rMATS doesn't output gene expression. It outputs the counts of reads that support the inclusion and skipping isoform of each event in columns like IJC_SAMPLE_1. Potentially kallisto could output what you are looking for: https://pachterlab.github.io/kallisto/manual

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Few questions regarding rMATS #400

Few questions regarding rMATS #400

tanya-lasagne commented May 7, 2024

EricKutschera commented May 7, 2024

tanya-lasagne commented May 8, 2024

EricKutschera commented May 8, 2024

tanya-lasagne commented May 17, 2024 •

edited

Loading

EricKutschera commented May 17, 2024

Few questions regarding rMATS #400

Few questions regarding rMATS #400

Comments

tanya-lasagne commented May 7, 2024

EricKutschera commented May 7, 2024

tanya-lasagne commented May 8, 2024

EricKutschera commented May 8, 2024

tanya-lasagne commented May 17, 2024 • edited Loading

EricKutschera commented May 17, 2024

tanya-lasagne commented May 17, 2024 •

edited

Loading