-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BENCHMARK] dataset comparison igenvar only #201
[BENCHMARK] dataset comparison igenvar only #201
Conversation
Codecov Report
@@ Coverage Diff @@
## master #201 +/- ##
=======================================
Coverage 98.35% 98.35%
=======================================
Files 18 18
Lines 850 850
=======================================
Hits 836 836
Misses 14 14 Continue to review full report at Codecov.
|
test/benchmark/caller_comparison_iGenVar_only/workflow/rules/eval.smk
Outdated
Show resolved
Hide resolved
test/benchmark/caller_comparison_iGenVar_only/workflow/rules/eval.smk
Outdated
Show resolved
Hide resolved
test/benchmark/caller_comparison_iGenVar_only/workflow/scripts/plot_all_results.R
Outdated
Show resolved
Hide resolved
b77c70a
to
4e6971d
Compare
825406f
to
4faafee
Compare
41fbde1
to
24890f5
Compare
This is interesting! It makes sense that pacbio CCS reads are much easier to call SVs from: they have long read lengths and relatively high accuracy. However, it is a bit concerning that just illumina mate-pair reads result in such low accuracy. I'd be curious to see how this looks specifically with something like Deletions, since deletions can be detected somewhat robustly just via read-depth. Maybe if we can find out if there is a specific variant which is bringing the whole curve down, we can better understand what the issue there is. |
Signed-off-by: Lydia Buntrock <[email protected]>
Signed-off-by: Lydia Buntrock <[email protected]>
Signed-off-by: Lydia Buntrock <[email protected]>
… but with DUP as INS Signed-off-by: Lydia Buntrock <[email protected]>
Signed-off-by: Lydia Buntrock <[email protected]>
24890f5
to
9de92e5
Compare
UPDATE: look at the new plots below.
In this plot you can see the results of iGenVar with 2 short read and 3 long read sets and their combinations.
Some of the example BAM files (all HG002) are aligned to different references:
Since the truth set does not contain DUPs, I created another plot where all DUPs are interpreted as INS.
The important question now is, are most sets really that bad, or does iGenVar simply not find SVs. This I want to find out in comparison with other callers.