Code for the Cell fitness is an omniphenotype
paper.
Code to generate data for each figure was written in python, R, Perl, PHP, or MySQL. The underlying data can be accessed as described in the methods.
-
figure3A.sql has multiple MySQL commands that are needed to generate the data shown in Figure 3A.
-
Any gene cluster can be input into the figure3B.py script to yield the data that can be converted into the volcano plot as shown in Figure 3B.
-
Figure 3B statistics (enrichment analysis) was performed using SciPy. The code is available via here
- For the common disease restricted analysis in Figure S1, preceeding this analysis, we dumped all of PubMed into a MySQL database table, which we then queried for the co-occurrence of
{gene} AND {condition}
.
- Guzzle, which is a PHP library to make HTTP requests, is required to run Figure 5 code.