-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Dayhoff encoding of amino acids #689
Conversation
yay!!
|
This PR is proving complicated because it adds a new molecule type which touches basically everything... |
Tests are passing on my machine (details below)
On Travis, I'm seeing a
Should I be worried about this? |
@olgabot I'm also seeing this segfault under Linux, I think there is something awry in the (BTW, awesome PR!) |
The logic of getting an amino acid reside had to get rewritten with the addition of
aww thanks! 😊 |
@olgabot I sent a PR to your branch, https://github.com/czbiohub/sourmash/pull/3 It keeps the logic from the previous translation unchanged, and avoid adding the UPDATE: I'm wrong, sorry. you need to check the residue, oops. I'll fix it. |
Cool now I'm getting a fun error because I added dayhoff conversion to
|
Whoo tests passing on my machine! aka it's not really working yet because it only works on one computer :) FYI it's now possible to run a subset of tests that use a particular fixture using
|
PR is failing checks because the coverage isn't sufficient. I've hunted through what I can to add coverage. If it's not enough, could you let me know which additional functions/methods/classes I should add tests for? |
Arggghh I'm getting an error in the docker image which uses Python 3.7:
But it runs just fine using the latest build, but on Python 3.6.4:
Yet somehow the Travis Build on 3.7 runs just fine? Is there anything clearly wrong in the Dockerfile ? |
Thanks @olgabot, this is amazing! I'll cut a new release with all the merged PRs from this weekend. |
Exciting! I'm also thinking about adding hydrohpobic-polar (HP) encoding
which is even more lossy.
---
Olga Botvinnik, PhD
olgabotvinnik.com <http://www.olgabotvinnik.com>
…On Sun, Aug 25, 2019 at 8:01 PM Luiz Irber ***@***.***> wrote:
Thanks @olgabot <https://github.com/olgabot>, this is amazing! I'll cut a
new release with all the merged PRs from this weekend.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#689>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAGE24A2BWGBNDEIR6G5L73QGNBPHANCNFSM4H4LGXLQ>
.
|
The Dayhoff encoding of amino acids was created by Margaret Oakley Dayhoff, arguably the first bioinformatician, and uses a lossy encoding of amino acid sequences. I'm interested in adding this feature to compare sequences across large evolutionary distances without penalizing minor residue changes.
Dayhoff table from
Peris, P., López, D., & Campos, M. (2008).
IgTM: An algorithm to predict transmembrane domains and topology in
proteins. BMC Bioinformatics, 9(1), 1029–11.
http://doi.org/10.1186/1471-2105-9-367
Original source:
Dayhoff M. O., Schwartz R. M., Orcutt B. C. (1978).
A model of evolutionary change in proteins,
in Atlas of Protein Sequence and Structure,
ed Dayhoff M. O., editor.
(Washington, DC: National Biomedical Research Foundation; ), 345–352.
make test
Did it pass the tests?make coverage
Is the new code covered?without a major version increment. Changing file formats also requires a
major version number increment.
changes were made?
Yes, I'll remove the irrelevant 10x barcode commits..-- rebased and force-pushed