-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Example stats #96
Comments
Also... As we briefly discussed with Sergey and Johan, having fragment-level stats like in hiclib and other pipelines (e.g. dangling ends, self-circles etc) would be very helpful for troubleshooting failed experiments. |
Let's collect all of the Please, @golobor , @Phlya , @nvictus review the list, prioritize and let's go from there |
I couldn't find these in the referenced posts, but it would also be nice to have: |
I personally think that adding new kinds of stats in principle is more important, and the different saving/printing options can be implemented later. Also, I don't think having optical dups is important (can we really do anything about them when preparing libraries? I doubt it...), but maybe I misunderstand something. |
Also note, that the fragment-level stats would require matching pairs with fragments... But perhaps with both inputs sorted and indexed it won't be very expensive? |
I guess it should be possible to address @gfudenberg 's point (a) quite easily, since these counts are all already present in the output of stats - https://github.com/mirnylab/pairsamtools/issues/68. Although, perhaps, the bins can be optimized a bit to make more smooth curves? But is having plots in the output in the plans? As an html/pdf report with different things, or just a folder with individual pngs/pdfs? Should their generation be part of stats, or a separate job, which just takes the output of stats? |
I know that DCIC was working QC for pairs for a while, the results are
here: https://github.com/4dn-dcic/pairsqc
…On 20 May 2018 at 11:41, Ilya Flyamer ***@***.***> wrote:
I guess it should be possible to address @gfudenberg
<https://github.com/gfudenberg> 's point (a) quite easily, since these
counts are all already present in the output of stats -
mirnylab/pairsamtools#68
<https://github.com/mirnylab/pairsamtools/issues/68>. Although, perhaps,
the bins can be optimized a bit to make more smooth curves?
But is having plots in the output in the plans? As an html/pdf report with
different things, or just a folder with individual pngs/pdfs? Should their
generation be part of stats, or a separate job, which just takes the output
of stats?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#96 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AA3uCgIQk-OYylmYXG6f5S8-V0SQMnghks5t0Y6SgaJpZM4TjL69>
.
|
Yeah, I've seen that and even tried to install once without success. But considering there is |
Hi guys,
Would be great to add some examples of stats for good and bad experiments with explanations which step in the protocol might have failed, and how to understand this. I am currently troubleshooting why my recent Hi-C have not been working well, and with the coded pair type annotation this part of it is a little more complicated than I expected.
Also, more of a pairsamtools issue, but related: W pair type is still called C there in the docs (it's the same thing, right? I seem to remember it mentioned at some point).
The text was updated successfully, but these errors were encountered: