-
Notifications
You must be signed in to change notification settings - Fork 183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Start codon handling in 1.7.x #266
Comments
@nerdstrike have any code or examples we can test around? This generally helps 'speed' things along. |
Output from 1.2.3, 1.6.924,
Output from 1.7.2,
It's a niche problem, but occurs if you're fishing for start codons and don't know you're in a masked region or out of phase. Also applies to NTG. It could be, yes, but the answer should be "maybe" at best. |
|
Yep. I’ve seen all of those, including CTG. I’m sure you have as well ;-)
On Feb 15, 2018, at 1:00 AM, Torsten Seemann <[email protected]<mailto:[email protected]>> wrote:
NTG = maybe
p=0.25 for eukaryotes
p>0.25 for bacteria
:-)
|
Not just bacteria http://www.jbc.org/content/285/7/4595.full Curious about the incidence of CTG in bacteria with experimental confirmation. Citation? |
Been a while but I think something from either Mycobacterium or Streptomyces.
Not just bacteria
http://www.jbc.org/content/285/7/4595.full
http://genome.cshlp.org/content/28/1/25.full
Curious about the incidence of CTG in bacteria with experimental confirmation. Citation?
|
Whoops. Didn't realize Github would let me edit your reply when I meant to edit mine. My colleague told me that CTG starts are known in phage T4. |
I suppose I should clarify that my issue arises on the default codon table, but could nonetheless arise on alternate tables too. |
…dons (#266) In the case of ambiguous codons this method should only return true if all possible codons are start codons (same as is_ter_codon). This was the behaviour in BioPerl 1.2.3 but since at least BioPerl 1.6.924 it returns true if at least one codon is a start codon. This change fixes that regression and makes behaviour consistent with is_ter_codon.
The following fix was applied to 1.7 releases to deal with NNN mapping to a stop codon.
260ebb9
Ensembl also encounters unexpected behaviour around the handling of start codons, where Bio::Tools::CodonTable::is_start_codon('NNN') returns true. The cause is the same, i.e. the inversion of default behaviour, and so the solution should be pretty similar I hope!
I believe this also applies to 'NN' and other non-triplet equivalents too.
The text was updated successfully, but these errors were encountered: