-
Notifications
You must be signed in to change notification settings - Fork 355
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FeatureRequest: Support for Smoove/Lumpy population SV calling #2652
Comments
Wim; |
One (major) upside of (often) (re-)doing the Smoove&Lumpy population SV calling (from single sample Lumpy VCF) is that is results in a (up-to-date) square SV table for the expanding sets of samples that we work with. Just merging the Smoove&Lumpy batch SV VCF files would results in a non-square (i.e. Swiss cheese) SV table. Like described here also for small variants under batch analysis: We don't have much public 'truth' data that we can use for testing. I am curious about the sensitivity and specificity of the population calling versus all together at once mode. Also it makes more sense to me do this validation in Human, since there probably is more 'truth' data, and the validation has more value for a bigger set of the bcbio users. We managed for now to run Smoove&Lumpy outside of bcbio. |
Hi,
Would it be possible to add Smoove/Lumpy population SV calling to Bcbio?
Bcbio currently support Smoove/Lumpy SV calling for small (n < ~ 40) sets of samples.
https://github.com/bcbio/bcbio-nextgen/blob/2e4c888b4c092572961d30d5f2f5068f7387e043/bcbio/structural/lumpy.py#27
For more than 40 samples the Smoove github documentation recommends to run Smoove in a 2 level map reduce way:
https://github.com/brentp/smoove (section population calling )
This should scale up to thousands of WGrS samples I gather from an issue on the Smoove github page. I hope the sensitivity ans specificity is also still good compared to joint SV variant calling and genotyping.
Thank you.
The text was updated successfully, but these errors were encountered: