Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integration of RF model to predict efficacy of immune checkpoint blockade (BioModels) #1355

Merged
merged 6 commits into from
Nov 7, 2023

Conversation

anuprulez
Copy link
Contributor

@anuprulez anuprulez commented Nov 7, 2023

The PR adds a tool that uses an RF model, downloaded via a Docker container, to predict the efficacy of immune checkpoint blockage on the provided test data. The RF model is available via BioModels (https://www.ebi.ac.uk/biomodels/BIOMD0000001066). The Docker container available in BioModels (https://www.ebi.ac.uk/biomodels/BIOMD0000001066#Files) for prediction has been customised to reduce its uncompressed size (by half, from over 4 GBs to less than 2). The newly developed container is linked in the tool (docker.io/anupkumar/immune-checkpoint:latest) and also stored on Docker Hub (https://hub.docker.com/repository/docker/anupkumar/biomd0000001066/general). The script used to create this customised container is available on GitHub (https://github.com/anuprulez/biomodelsml-docker/tree/master/rf_immune_checkpoints_blockage).

test file(s):
test_data.xlsx
test_data.csv

Use one of these files to test the tool on Galaxy server.

Tool's UI (on a local Galaxy instance):

local_instance

ping @bgruening

<container type="docker">docker.io/anupkumar/immune-checkpoint:latest</container>
</requirements>
<command><![CDATA[
cp /home/\$NB_USER/forest16.onnx `pwd` && python '$biom_script'
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
cp /home/\$NB_USER/forest16.onnx `pwd` && python '$biom_script'
cp /home/\$NB_USER/forest16.onnx ./ &&
python '$biom_script'

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I fixed the permissions in the container, let's see. Not sure if we need to create a new user just like the GPU based container.

</test>
<test>
<param name="input_file" value="test_data.csv" ftype="csv" />
<output name="output_file" file="pred_data.tabular" compare="sim_size" delta="5" />
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sim_size is not recommendade, can we use assert_contents here, number of columns, number of rows https://docs.galaxyproject.org/en/latest/dev/schema.html#id112

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, sound good!

@@ -0,0 +1,89 @@
<tool id="bio_ml_tool" name="Random Forest model" version="0.0.1" profile="22.05">
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
<tool id="bio_ml_tool" name="Random Forest model" version="0.0.1" profile="22.05">
<tool id="biomodel_BIOMD0000001066" name="Random Forest model" version="1" profile="22.05">

I have taken those from https://www.ebi.ac.uk/biomodels/BIOMD0000001066#History

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sounds good, fixed it.

<tool id="bio_ml_tool" name="Random Forest model" version="0.0.1" profile="22.05">
<description>to predict efficacy of immune checkpoint blockade across multiple cancer patient cohorts</description>
<requirements>
<container type="docker">docker.io/anupkumar/immune-checkpoint:latest</container>
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe also rename the container to fit the tool_id?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks, fixed!

@bgruening bgruening merged commit ce5f617 into bgruening:master Nov 7, 2023
10 checks passed
@anuprulez anuprulez deleted the biomodelsml-immune-checkpoint branch November 7, 2023 16:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants