Feature: Adapt the grt123 model #4

reubano · 2017-08-01T22:23:21Z

Overview

Currently, there is a just a placeholder in the algorithm that classifies nodules in scans. Nodules are areas of interest that might be cancerous. We need to adapt the Data Science Bowl (DSB) algorithms to predict P(cancer) given an iterator of nodule centroids for an image.

The top DSB algorithm (grt123) was written to run on a GPU for Python2. It would be nice to integrate this algorithm into the current structure and update it to run on Python3 (potentially on a CPU as well).

Expected Behavior

Given the grt123 model trained to perform this task, a DICOM image, and an iterator of nodule centroids, yield the P(cancer) for each nodule.

Design doc reference: Detect and select

Technical details

The majority of the Python3 and CPU conversion has been completed and is available in the conversion branch.
The forked model is available here (reads source DICOM images from S3).
One area that definitely needs review is the Py2/Py3 floor/true division conversions. Some calculation explicitly converted numbers to floats, and in those cases it was apparent that true division was desired. However, the remaining floor division calculations should be checked to ensure that true is not the appropriate operation.
If you get a UnicodeDecodeError error while trying to load the serialized Torch model, use the torch_loader function in the utils module instead.
When running on the CPU, it isn't necessary to perform that much work. Just enough to obtain a plausible result in a reasonable amount of time.
This feature should be implemented in the prediction/classify/trained_model/predict method.

Acceptance criteria

the integrated model produces results similar to the forked model

NOTE: All PRs must follow the standard PR checklist.

The text was updated successfully, but these errors were encountered:

Serhiy-Shekhovtsov · 2017-09-07T07:48:31Z

I'd like to work on this one.

P.S. btw, is it a good idea to tell when you are working on something?

Serhiy-Shekhovtsov · 2017-09-17T07:36:46Z

Hi @reubano! I've fixed few issues on pre-processing part and managed to have it running successfully and giving the same result as original code. These changes have been pushed now.
Now I am looking into detection part, but encountered this issue:

Torch: not enough memory: you tried to allocate 21GB

Does it really take this much memory or is it an issue with environment \ code?

reubano · 2017-09-18T19:16:11Z

@Serhiy-Shekhovtsov

Hi @reubano! I've fixed few issues on pre-processing part and managed to have it running successfully and giving the same result as original code.

That's great! Nice job!

Does it really take this much memory or is it an issue with environment \ code?

I'm afraid i don't know the answer to that one. Do you get the same error when running the base repo?

dchansen · 2017-09-19T14:00:28Z

I have the full model adapted for Python3 in the pull request at #122

The Nodule identification part is extremely memory hungry. Be sure to enable the volatile setting on the Variables, and reduce the chunk size in the SplitComb.

Serhiy-Shekhovtsov · 2017-09-19T18:17:52Z

Do you get the same error when running the base repo?

Yes.

I have the full model adapted for Python3 in the pull request at #122

Great!

reubano · 2017-09-20T11:39:29Z

@Serhiy-Shekhovtsov did @dchansen recommendations help you out?

Serhiy-Shekhovtsov · 2017-09-20T15:27:29Z

I didn't check it yet.

Serhiy-Shekhovtsov · 2017-09-22T06:55:39Z

@dchansen I am getting this error when trying to checkout your repo:

$ git-lfs.exe smudge -- prediction/src/algorithms/classify/assets/gtr123_model.ckpt
Error downloading object: prediction/src/algorithms/classify/assets/gtr123_model.ckpt (eb2c037dd55e2f49da95657f7a7851cbcfe2f2b516848ed03f8c5c820f3e16b4)

Smudge error: Error downloading prediction/src/algorithms/classify/assets/gtr123_model.ckpt (eb2c037dd55e2f49da95657f7a7851cbcfe2f2b516848ed03f8c5c820f3e16b4): [eb2c037dd55e2f49da95657f7a7851cbcfe2f2b516848ed03f8c5c820f3e16b4] Object does not exist on the server: [404] Object does not exist on the server

Any ideas what is it and how to solve it?

dchansen · 2017-09-22T08:13:49Z

It was an error on my side. It's fixed in the repo now.

Serhiy-Shekhovtsov · 2017-09-22T08:18:36Z

Thanks, now I can pull it.

reubano added this to the 1-mvp milestone Aug 1, 2017

reubano added feature medium official prediction labels Aug 1, 2017

reubano changed the title ~~Feature: Integrate the algos-grt123 branch~~ Feature: Adapt the grt123 model Aug 4, 2017

Serhiy-Shekhovtsov mentioned this issue Sep 17, 2017

Feature: Implement identification algorithm #1

Closed

2 tasks

dchansen mentioned this issue Sep 18, 2017

Converted Team 'grt123' identification and classification algorithms #122

Merged

1 task

reubano closed this as completed Sep 29, 2017

vessemer mentioned this issue Oct 17, 2017

169 | 4 MetaData and PreprocessCT refactored, grt123 processing adopted #170

Merged

1 task

reubano mentioned this issue Oct 20, 2017

Document the Daniel Hammack algorithm #19

Closed

1 task

vessemer mentioned this issue Nov 20, 2017

'Import' UI element should be able to create backend ImageSeries model instance #145

Closed

5 tasks

reubano mentioned this issue Jan 18, 2018

Continuous improvement of nodule classification models (see #2) #131

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Adapt the grt123 model #4

Feature: Adapt the grt123 model #4

reubano commented Aug 1, 2017 •

edited

Loading

Serhiy-Shekhovtsov commented Sep 7, 2017

Serhiy-Shekhovtsov commented Sep 17, 2017

reubano commented Sep 18, 2017

dchansen commented Sep 19, 2017

Serhiy-Shekhovtsov commented Sep 19, 2017 •

edited

Loading

reubano commented Sep 20, 2017

Serhiy-Shekhovtsov commented Sep 20, 2017

Serhiy-Shekhovtsov commented Sep 22, 2017

dchansen commented Sep 22, 2017

Serhiy-Shekhovtsov commented Sep 22, 2017

Feature: Adapt the grt123 model #4

Feature: Adapt the grt123 model #4

Comments

reubano commented Aug 1, 2017 • edited Loading

Overview

Expected Behavior

Technical details

Acceptance criteria

Serhiy-Shekhovtsov commented Sep 7, 2017

Serhiy-Shekhovtsov commented Sep 17, 2017

reubano commented Sep 18, 2017

dchansen commented Sep 19, 2017

Serhiy-Shekhovtsov commented Sep 19, 2017 • edited Loading

reubano commented Sep 20, 2017

Serhiy-Shekhovtsov commented Sep 20, 2017

Serhiy-Shekhovtsov commented Sep 22, 2017

dchansen commented Sep 22, 2017

Serhiy-Shekhovtsov commented Sep 22, 2017

reubano commented Aug 1, 2017 •

edited

Loading

Serhiy-Shekhovtsov commented Sep 19, 2017 •

edited

Loading