Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Team TaoFuFa #5

Open
wants to merge 77 commits into
base: master
Choose a base branch
from
Open

Team TaoFuFa #5

wants to merge 77 commits into from

Conversation

Skyquek
Copy link

@Skyquek Skyquek commented Apr 14, 2020

In this project, we use genetic algorithm and deep learning to discover existing or new drugs that could bind to COVID-19 main protease (6LU7) and has a logP value that is lower than 5.

@Skyquek
Copy link
Author

Skyquek commented Apr 14, 2020

Conversion

How to use

* You may want to finish reading the repo before starting.

Initial generation

  1. Run 'Initial Network.ipynb' to get generation 0 SMILES.

For every generation (including initial)

  1. Run 'Evaluation and Refinement-localGA.ipynb'

Sharding the sdf files

  1. In PyRx, load the sdf files and minimize the molecules, then export it to pdbqt file format.
  2. Copy out the .pdbqt files to a seperate folder and run the sharding script (/scripts/folderSplitter/shard.ps1).
  3. Copy everything from the /scripts/binding folder to each generated folder.
  4. The folder is ready for distribution.

Computing the binding affinity

  1. For each folder, run PowerShell in it and run the /scripts/binding/binding.ps1 file.
  2. The validation results should be in the output folder.

Consolidation

  1. When processing is done, consolidate all the files in each output folder, then copy files in/scripts/conversion to it.
  2. Run PowerShell in it and run the /scripts/conversion/conversion.ps1 file.
  3. The compute results would be consolidated in the results.csv file of the same folder.

A generation is complete, you may go to 2 to obtain the next gen.

Post processing (optional, but recommended to do for last gen)

  1. Run Final Results.ipynb to visualize the data and filter up the best molecule.
  2. A file will be created in generations/master_results_table_final.csv. This file can be validated by bioinformatics.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants