-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
-m and -M options don't make a difference #82
Comments
We pushed a fix for this issue which should correct the behavior of
If you are able to build bowtie from source, please let me know if this commit fixes this issue for you as well. |
This change has been included in our latest version. |
It seems like the new Bowtie release (1.2.3) incorrectly reports the number of "reads processed" in the '-M 1' mode by counting twice the multi-mapping reads (in your example 1000 original reads become reported as 1022 reads processed) and hence all % estimates are also changing. Apart from that reporting bug (or feature), I don't find apparent differences in the actual alignments produced by Bowtie_1.2.3 versus Bowtie_1.2.2 using '-M 1' mode. |
Thank you for picking up on this. It helped me realize that I never committed my complete changes for this issue. With all the changes in place here are the new outputs from the above commands:
I think this should be inline with your expectations. [EDIT: updated output with the results of including --sam-nohead and -S flags] |
There's still one bug that needs fixing: the reads processed for |
Ok I fixed the issue and changed the summary a little bit so that it, in my opinion, makes more sense.
Let me know your thoughts on this. |
Thanks a lot for the quick feedback. The modified reporting looks good to me, except that the suppressed alignments are lost from the SAM file, which may cause problems in some cases. Ideally all input reads should be reported in the SAM file by default. On a different note, probably not worth opening a special issue, I noticed that while older Bowtie versions delete all non-ACGTN bases before sequence alignment, since version 1.2.2 such bases are all converted to "A" during alignment and appear as "A" in the SAM files. This becomes relevant when, for example, one uses Bowtie "off label" to align miRNA datasets from miRBase (containing Us instead of Ts) to newly assembled genomes, resulting in spurious and misleading alignments of modified sequences. It would be good having some documentation on that issue and maybe also implement different rules, e.g. conversion of Us into Ts and other atypical bases into Ns (instead of As) plus giving some warning messages to make people aware of the modifications. |
I have mapped reads with the following commands:
bowtie --phred33-quals -q -n 2 -l 10 --best --strata -y -S -k 1 -m 1 --al aligned.fastq -un unaligned.fastq Reference Query.fq > Mapping.sam
bowtie --phred33-quals -q -n 2 -l 10 --best --strata -y -S -k 1 -M 1 --al aligned.fastq -un unaligned.fastq Reference Query.fq > Mapping.sam
The first command should report only unique mappings while the second command should report also reads that mapped at different locations (but only one of these locations).
However, If I count the number of reads that aligned in the aligned.fastq files, I get the exact same result for both mappings. What is here wrong?
The text was updated successfully, but these errors were encountered: