forked from recluze/deepseq
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
5 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -2,12 +2,17 @@ | |
|
||
Accurate annotation of protein functions is important for a profound understanding of molecular biology. A large number of proteins remain uncharacterized because of the sparsity of available supporting information. For a large set of uncharacterized proteins, the only type of information available is their amino acid sequence. In this paper, we propose DeepSeq -- a deep learning architecture -- that utilizes only the protein sequence information to predict its associated functions. The prediction process does not require handcrafted features; rather, the architecture automatically extracts representations from the input sequence data. Results of our experiments with DeepSeq indicate significant improvements in terms of prediction accuracy when compared with other sequence-based methods. Our deep learning model achieves an overall validation accuracy of 86.72%, with an F1 score of 71.13%. Moreover, using the automatically learned features and without any changes to DeepSeq, we successfully solved a different problem i.e. protein function localization, with no human intervention. Finally, we discuss how this same architecture can be used to solve even more complicated problems such as prediction of 2D and 3D structure as well as protein-protein interactions. | ||
|
||
data:image/s3,"s3://crabby-images/2d941/2d941c9d3d9db8af3c9e079bcb023fed1cc0fe04" alt="Deep Learning for Protein Function Prediction" | ||
|
||
# Authors: | ||
|
||
- Nauman ([email protected], [email protected], recluze.wordpress.com) -- Queries about ML should go here. | ||
- Hafeez ur Rehman ([email protected]) -- Queries about Bioinformatics should go here. | ||
|
||
Preprint of related publication available here: http://www.biorxiv.org/content/early/2017/07/25/168120 | ||
|
||
data:image/s3,"s3://crabby-images/4b61d/4b61d4bbb47f7f0e968da1204b7dbde244fa823a" alt="Domain localization" | ||
|
||
|
||
# Import points: | ||
- Requires python2.7 | ||
|