Benchmark with full data: Figure 1 notebook #328

EYezerets · 2020-10-19T18:41:37Z

Reference issue

Addresses issue 321
: Run UF figure 1 tutorial with real parameters and makes progress toward issue 62: re-write all the UF notebooks called ProgL repo functions.

Type of change

Documentation, Bug fix

What does this implement/fix?

This was run on AWS with the full data, and the figure is updated. Now, the tutorial for Figure 1 created by @bstraus1 shows the figure run with the test parameters, while this notebook in benchmarks/uf_posterior_visualization shows Figure 1 run using UncertaintyForest() with the real parameters. This also fixes a bug in forest.py (self.max_depth is now used), and allows UncertaintyForest() to take tree_construction_proportion as an argument (set to 0.4 in the original uncertainty forest paper)

Additional information

Update Staging

levinwil · 2020-10-19T18:56:02Z

Please don't save fig1.pdf - it's already in the notebook itself
You don't need the functions folder - these functions are already defined in docs/tutorials/functions so please just import from there so we don't have duplicate code in the same repo and remove ProgLearn/benchmarks/uf_posterior_visualization/functions/ entirely. You don't need another functions folder defining the same functions.

levinwil

@EYezerets please see above comments

bstraus1 · 2020-10-19T18:58:17Z

Will you also reference #62 in this issue description? It is the umbrella issue which included #321.

levinwil · 2020-10-19T19:05:16Z

Also, the hyperparams defining UF in this notebook do not match the original uf hyperparams.

This is the original notebook used to produce figure 1 in the UF paper: https://github.com/neurodata/uncertainty-forest/blob/master/figs/fig1/figure-1.ipynb

Please note that the fraction of the data used to generate tree structure was .4. In ProgLearn, we use .67.

Please add a parameter to the UncertaintyForest initializer (https://github.com/neurodata/ProgLearn/blob/main/proglearn/forest.py#L218) and call it "tree_construction_proportion." Please add that parameter to the initialization docstring for UF. Then feed that parameter into the LifelongClassificationForest as the default_tree_construction_proportion on this line (https://github.com/neurodata/ProgLearn/blob/main/proglearn/forest.py#L237). Then PR that feature addition into staging.

Then rerun the notebook and set tree_construction_proportion = .4 in the notebook when initializing UF. After you make the updates listed a few comments above, and have done the instructions described here, make a PR (separate from the feature addition PR) for the rerun notebook into staging.

EYezerets · 2020-10-20T00:39:12Z

@levinwil Thanks for all the guidance! If we're changing tree_construction_proportion, do we want to go ahead and also set the finite sample correction to kappa=3?
I believe that Fig1 saves as a pdf from the plotting function that's in the functions folder that @bstraus1 's tutorial also calls (please correct me if I'm wrong). Do you want me to just delete the pdf before I commit, or change the function's code so it doesn't output a pdf?
Looks like these are some significant changes. Are they relevant to @bstraus1 's tutorial notebook as well, or do we not care since it just needs to run?

levinwil · 2020-10-20T00:44:55Z

We don’t currently have the capability to set kappa.
Please both delete the PDF and change the function so that it does NOT save further PDF’s
Don’t make these changes to the Ben’s tutorial notebook

Staging updates

levinwil

@EYezerets The Travis checks did not pass. Please black format your files. There are instructions on how to do so in the contribution guidelines.

EYezerets · 2020-10-21T12:06:43Z

Hi @levinwil, sorry I didn't see your comment about black formatting. I have not used this, since I was working with the uncertainty-forest repository before. Does it mean I need to pip install these two things in my virtual environment and then rerun the code? (I found this: https://travis-ci.org/github/neurodata/ProgLearn/builds/737697602/config and this: https://bitsandbrains.io/2020/10/05/pr-checklist)

pip install -U pytest pytest-cov codecov
pip install black

levinwil · 2020-10-21T12:20:07Z

There are instructions on how to black format in the contribution guidelines: http://proglearn.neurodata.io/contributing.html

You simply run:
pip install black
black <file/folder name>

In your case, you should insert the folder ‘proglearn’, so the second command will be ‘black proglearn’ (the source code directory, NOT the overall repository)

Please note that you should ONLY run black formatting on source code in the proglearn directory. You should NOT black format any Jupyter notebooks.

EYezerets · 2020-10-21T14:06:45Z

Ah! I see OK thank you! That makes more sense. Will do

Merge pull request neurodata#328 from EYezerets/staging

EYezerets added 4 commits October 18, 2020 23:18

Merge pull request #1 from neurodata/staging

47f1e07

Update Staging

add uf_posterior_visualization and functions in benchmarks

6bedb32

figure 1 full run

e63226d

Update text in fig1 in benchmarks

9f1579f

levinwil requested changes Oct 19, 2020

View reviewed changes

levinwil linked an issue Oct 19, 2020 that may be closed by this pull request

re-write all the UF notebooks called ProgL repo functions #62

Closed

EYezerets added 6 commits October 20, 2020 11:23

add tree construction parameter to UncertaintyForest

d14a83c

Merge pull request #2 from neurodata/staging

18acf01

Staging updates

add doc string in UF, updated from neurodata staging branch

ec495e5

navigate to docs/tutorials/functions - attempt 1

5d21b36

navigate to docs/tutorials/functions - attempt 2

30fcd37

add path to functions - attempt 1

a31566d

levinwil requested changes Oct 20, 2020

View reviewed changes

Figure 1 full run with tree_construction_proportion = 0.4

52409a9

black formatted proglearn

c262c6b

levinwil approved these changes Oct 21, 2020

View reviewed changes

levinwil merged commit f844065 into neurodata:staging Oct 21, 2020

EYezerets added a commit to EYezerets/ProgLearn that referenced this pull request Oct 21, 2020

Merge pull request #3 from neurodata/staging

220d45c

Merge pull request neurodata#328 from EYezerets/staging

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark with full data: Figure 1 notebook #328

Benchmark with full data: Figure 1 notebook #328

EYezerets commented Oct 19, 2020 •

edited

Loading

levinwil commented Oct 19, 2020

levinwil left a comment

bstraus1 commented Oct 19, 2020

levinwil commented Oct 19, 2020

EYezerets commented Oct 20, 2020

levinwil commented Oct 20, 2020

levinwil left a comment

EYezerets commented Oct 21, 2020

levinwil commented Oct 21, 2020 •

edited

Loading

EYezerets commented Oct 21, 2020

Benchmark with full data: Figure 1 notebook #328

Benchmark with full data: Figure 1 notebook #328

Conversation

EYezerets commented Oct 19, 2020 • edited Loading

Reference issue

Type of change

What does this implement/fix?

Additional information

levinwil commented Oct 19, 2020

levinwil left a comment

Choose a reason for hiding this comment

bstraus1 commented Oct 19, 2020

levinwil commented Oct 19, 2020

EYezerets commented Oct 20, 2020

levinwil commented Oct 20, 2020

levinwil left a comment

Choose a reason for hiding this comment

EYezerets commented Oct 21, 2020

levinwil commented Oct 21, 2020 • edited Loading

EYezerets commented Oct 21, 2020

EYezerets commented Oct 19, 2020 •

edited

Loading

levinwil commented Oct 21, 2020 •

edited

Loading