Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Output read counts file? #65

Open
imillercrews opened this issue Jan 27, 2021 · 0 comments
Open

Output read counts file? #65

imillercrews opened this issue Jan 27, 2021 · 0 comments
Assignees
Labels
question Further information is requested

Comments

@imillercrews
Copy link

I noticed that if using multiple bam files it generates a table '_data_per_samples.txt' of TPM values for each gene/transcript for each sample. In order to replace FeatureCounts in a workflow, was wondering if it be useful to have a similar table of read counts generated?

That way you could use the TPM table to identify genes/transcripts that had low TPM across samples (e.g. you only want to analyze genes that had a TPM of 2 or greater in 75% of your samples), filter out those genes from the read counts table and then go onto some differential gene analysis directly. Wouldn't that save a step of having to merge '_gene.out' files across each sample if you wanted read counts?

Maybe a simple table such as:

Gene_Id
Chr
sample1_Count_Reads
sample2_Count_Reads
sample3_Count_Reads

@r78v10a07 r78v10a07 added the question Further information is requested label Feb 6, 2021
@r78v10a07 r78v10a07 self-assigned this Feb 6, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants