Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Calculate MD tag #8

Open
holtgrewe opened this issue Mar 23, 2017 · 5 comments
Open

Calculate MD tag #8

holtgrewe opened this issue Mar 23, 2017 · 5 comments

Comments

@holtgrewe
Copy link

It would be useful to implement calculation of the MD tag just like BWA does it. This allows to perform simple variant calling and estimating error rates from the BAM file alone without a reference.

@ocxtal
Copy link
Owner

ocxtal commented Mar 26, 2017

Sorry for my late reply😥

I'm currently working on adding several sam tags, including MD and SA (supplementary alignments). The code still has some bugs, so please wait for a few days😭.

(I expect I can finish up the tests for this branch by the middle of this week...)

Thanks,

Hajime Suzuki

@holtgrewe
Copy link
Author

Sounds on the na12787 sequel data set on DNA nexus it seems that minialign produces shorter aligned fragments than ngm lr and bwa pacbio. Do you have an idea on possible reasons?

@ocxtal
Copy link
Owner

ocxtal commented Mar 27, 2017

It is confirmed that minialign tends to split alignment over short indels. If such indel resides around the end of a read, the shorter one might be removed out from the resulting alignments. It is the most possible situation.

I want to examine the situation in my hand, could you tell me the sample url and the parameters? (Is this? http://www.pacb.com/blog/identifying-structural-variants-na12878-low-fold-coverage-sequencing-pacbio-sequel-system/ )

Thanks,

Hajime Suzuki

@ocxtal
Copy link
Owner

ocxtal commented Mar 27, 2017

Though it is my personal impression, I think the NGM-LR currently performs the best on detecting indels ranging from 20 to 100 bases. It seems the performance of minialign is between NGM-LR and BWA-MEM, but much closer to the latter. I am now struggling to rescue such alignments with an additional supplementary alignment collection algorithm. It is different from the NGM-LR's convex gap-penalty function approach but I'm sure it will also works well on SV detection.

@ocxtal
Copy link
Owner

ocxtal commented Apr 3, 2017

I've just added MD tag option in the release 0.5.0 (enabled with -TMD flag; passing -TNM,MD result in equivalent output with samtools calmd). Please try it out😄
https://github.com/ocxtal/minialign/tree/minialign-0.5.0

Hajime Suzuki

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants