Output read counts file? #65

imillercrews · 2021-01-27T17:45:49Z

I noticed that if using multiple bam files it generates a table '_data_per_samples.txt' of TPM values for each gene/transcript for each sample. In order to replace FeatureCounts in a workflow, was wondering if it be useful to have a similar table of read counts generated?

That way you could use the TPM table to identify genes/transcripts that had low TPM across samples (e.g. you only want to analyze genes that had a TPM of 2 or greater in 75% of your samples), filter out those genes from the read counts table and then go onto some differential gene analysis directly. Wouldn't that save a step of having to merge '_gene.out' files across each sample if you wanted read counts?

Maybe a simple table such as:

Gene_Id
Chr
sample1_Count_Reads
sample2_Count_Reads
sample3_Count_Reads

r78v10a07 added the question Further information is requested label Feb 6, 2021

r78v10a07 self-assigned this Feb 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output read counts file? #65

Output read counts file? #65

imillercrews commented Jan 27, 2021

Output read counts file? #65

Output read counts file? #65

Comments

imillercrews commented Jan 27, 2021