CH #101

evanbiederstedt · 2019-04-02T22:54:46Z

Clonal Hematopoiesis, but it's too tricky to spell. :)

Here's how we are going to do this:

Rough idea: we swap the TN labels, do unmatched variant calling on the normal, then genotype the tumor. Some CH mutations will be present in the tumor because of blood contamination and unmatched calling ensures we don’t miss those

Once there are analysis-ready TN pairs, we execute CH as follows:

CH is running Mutect2 and Strelka2, but with the normal-tumor labels switched.

For filtering artifacts/false positives (from Clinical Bioinformatics):

gnomAD
genotyping values from panel of blood samples: 300 CH-free and young patients
more of a work in progress, filtering SNPs with LOH in tumor using FACETS data (this will only be important for high VAF variants in blood

We will ask them for input, and possible they could share code.

kpjonsson · 2019-04-09T15:35:36Z

This will add a lot of runtime due to Mutect, so the question is if we think only using Strelka2 would suffice. Might require some investigation.

evanbiederstedt · 2019-04-09T15:46:45Z

Might require some investigation.

Precisely. Once implemented, we can investigate

evanbiederstedt · 2019-04-19T22:18:21Z

I think this should be the last thing we try.

I don't think CH has yet been really defined for WES/WGS data here, and would require a good deal of iteration to find the optimal solution.

--- There are some concerns (based on anecdotes from Ryan Ptashkin) that MuTect2 and Strelka2 filter out low VAF calls, which would be necessary for CH works. "Since the CH variants tend to live at the lower end of VAF range (increasing with Tx and /or age) I’d be concerned about high FDR at lower VAFs with Mutect2, but that is just limited data that i have seen". That's possible. One option (mentioned by Ryan) was to run Vardict! Many for benchmarking, but I think that's a bit of a bad idea. I'd rather entertain LoFreq2: https://github.com/CSB5/lofreq

There's also a question about whether we should take the union of these calls, or the intersection?

--- It appears no one has tried TN paired variant calling for this analysis. (I guess I misunderstood.) Calling against a matched tumor is bad, as there are plenty of tumor samples that have blood/lymphocyte infiltration within, i.e. contamination.

--- The best approach would be to call blood normal vs. a curated pooled normal of young patients without any hematological malignancies. We do not yet have a curated "normal" though. In order to create this, "use data from the youngest patients that you have, but check that they didnt have an active heme malignancy at time of sequencing and could even genotype for the most common blood mutations to exclude samples with obvious somatic mutations in blood".

I'm guessing what we'll end up doing here for the first official release is try to generate the analysis outputs necessary in order to converge on the best solution.

evanbiederstedt · 2019-04-23T05:41:26Z

Update: @ahmetz is working on a PoN for WES data

evanbiederstedt · 2019-04-25T15:51:23Z

I would try this caller:
https://hub.docker.com/r/lethalfang/lofreq

There's even a Dockerfile here: https://hub.docker.com/r/seandavi/lofreq/dockerfile

Vardict is going to be a pain, but it's possible: https://hub.docker.com/r/marghoob/vardict

evanbiederstedt added the enhancement New feature or request label Apr 2, 2019

evanbiederstedt added the backburner probably won't address in a near future label Apr 19, 2019

evanbiederstedt assigned taylorb-msk Apr 19, 2019

This was referenced Apr 19, 2019

PoN for somatic SNVs/indels? #126

Open

Creat PoN for CH #170

Open

evanbiederstedt added the postRelease label Aug 14, 2019

gongyixiao removed the postRelease label Dec 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CH #101

CH #101

evanbiederstedt commented Apr 2, 2019 •

edited

Loading

kpjonsson commented Apr 9, 2019

evanbiederstedt commented Apr 9, 2019

evanbiederstedt commented Apr 19, 2019

evanbiederstedt commented Apr 23, 2019

evanbiederstedt commented Apr 25, 2019

CH #101

CH #101

Comments

evanbiederstedt commented Apr 2, 2019 • edited Loading

kpjonsson commented Apr 9, 2019

evanbiederstedt commented Apr 9, 2019

evanbiederstedt commented Apr 19, 2019

evanbiederstedt commented Apr 23, 2019

evanbiederstedt commented Apr 25, 2019

evanbiederstedt commented Apr 2, 2019 •

edited

Loading