-
Notifications
You must be signed in to change notification settings - Fork 215
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CRAM support #619
Comments
My presumption was only partly correct. Anything that uses |
I just read part of the motivation to left this out of the .crai file: this will slow down the index process. Maybe we can argue how this information is quite relevant to speed up other downstream computations? Alternatively, we could implement a two-pass process where we first count reads, then compute scaling factors, then we apply the scaling factor. But this will require quite some work. |
I don't foresee a change to the cram index format at this point, that ship has sailed. I suspect we'll need to check |
Xref: samtools/hts-specs#137 So maybe some day we won't need two passes. |
For what it's worth, I need to modify pysam for this, since we need to be able to fetch unmapped reads at the end of the file. |
Support for fetching reads with no contig has just been added to pysam, so once the next release is out we can finally start supporting CRAM files (I still need to rewrite a bunch of stuff to actually support this, but it's at least possible now). |
This is mostly implemented, but annoying to test since all of the nose tests need to be run in the |
I've gotten around needing a fasta file by not decoding the sequence (I made a PR to add that to pysam), so now testing can be done in any directory. However, it turns out that htslib has a bug wherein not decoding some fields in a CRAM file can (in certain cases) affect whether a given alignment is returned by |
Relatedly, if the htslib issue is resolved and a new version of it and pysam pushed out then deepTools can support CRAM in Galaxy as well as on the command line, where I expect users to be able to handle decoding issues themselves. |
With htslib 1.7 the issues there should be resolved. Once a new pysam is released the CRAM branch can be merged. |
pysam 0.14 is now in bioconda, so this should be mergable. I've made not decoding read sequences the default, so it should be possible to support CRAM files in Galaxy. |
This is now merged into |
I presume that this already works, but it should be briefly tested and then mentioned in the documentation.
The text was updated successfully, but these errors were encountered: