Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Run Mapping alone for RNASeq data using bcbio #3239

Closed
Fazulur opened this issue May 27, 2020 · 4 comments
Closed

Run Mapping alone for RNASeq data using bcbio #3239

Fazulur opened this issue May 27, 2020 · 4 comments

Comments

@Fazulur
Copy link

Fazulur commented May 27, 2020

Dear Bcbio team,

I found the below issue related to run only alignment which works with wgs variantcalling pipeline.
#1030

But i didn't found the same for RNASeq data. Could you please let me know what changes to be done to yaml template in order to run only alignment & skip further steps like QC, generating counts using salmon & featureCounts etc for RNA_Seq data.

Thanks In Advance
Fazulur Rehaman

@roryk
Copy link
Collaborator

roryk commented May 27, 2020

Hi Fazulur,

We don't currently have an option to just run the alignments, but we could add it. What's your use case so I can think about if there is a better way to go about it?

@Fazulur
Copy link
Author

Fazulur commented May 28, 2020

Hi @roryk,

Thanks for your quick response.

Sometimes, We will be receiving data for same sample in batches from different flowcells. We are merging fastq from same flowcell using bcbio prepare_samples.py program.

We would like to do only alignment If we get new fastq files for same sampleid in next batch (instead of running enitrie bcbio RNA_Seq pipeline on new data), then merge previous bam with current bam file using samtools merge and submit the merged bam to bcbio RNA_Seq pipeline by modifying aligner: false in yaml config.

Thanks In Advance
Fazulur Rehaman

@naumenko-sa naumenko-sa mentioned this issue Jun 3, 2020
90 tasks
@naumenko-sa
Copy link
Contributor

Hi @Fazulur !

I have added your request to the bcbio priorities list.
Unfortunately, it seems it won't be addressed soon, sorry,
unless you want to contribute and test the solution yourself.

In the meanwhile, I'd warn you against creating hidden batch effects in your RNA-seq data.
See "Design aspects of RNA­seq" in https://peerj.com/preprints/27283/ and in the other sources.

Sergey

@Fazulur
Copy link
Author

Fazulur commented Jun 3, 2020

Hi @naumenko-sa,

Thanks a lot for update on my post and thanks for article on batch effects.

Thanks & Regards
Fazulur Rehaman

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants