-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Benchmark with full data: Figure 1 notebook #328
Conversation
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@EYezerets please see above comments
Also, the hyperparams defining UF in this notebook do not match the original uf hyperparams. This is the original notebook used to produce figure 1 in the UF paper: https://github.com/neurodata/uncertainty-forest/blob/master/figs/fig1/figure-1.ipynb Please note that the fraction of the data used to generate tree structure was .4. In ProgLearn, we use .67. Please add a parameter to the UncertaintyForest initializer (https://github.com/neurodata/ProgLearn/blob/main/proglearn/forest.py#L218) and call it "tree_construction_proportion." Please add that parameter to the initialization docstring for UF. Then feed that parameter into the LifelongClassificationForest as the Then rerun the notebook and set tree_construction_proportion = .4 in the notebook when initializing UF. After you make the updates listed a few comments above, and have done the instructions described here, make a PR (separate from the feature addition PR) for the rerun notebook into staging. |
@levinwil Thanks for all the guidance! If we're changing tree_construction_proportion, do we want to go ahead and also set the finite sample correction to kappa=3? |
|
Staging updates
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@EYezerets The Travis checks did not pass. Please black format your files. There are instructions on how to do so in the contribution guidelines.
Hi @levinwil, sorry I didn't see your comment about black formatting. I have not used this, since I was working with the uncertainty-forest repository before. Does it mean I need to pip install these two things in my virtual environment and then rerun the code? (I found this: https://travis-ci.org/github/neurodata/ProgLearn/builds/737697602/config and this: https://bitsandbrains.io/2020/10/05/pr-checklist) pip install -U pytest pytest-cov codecov |
There are instructions on how to black format in the contribution guidelines: http://proglearn.neurodata.io/contributing.html You simply run: In your case, you should insert the folder ‘proglearn’, so the second command will be ‘black proglearn’ (the source code directory, NOT the overall repository) Please note that you should ONLY run black formatting on source code in the proglearn directory. You should NOT black format any Jupyter notebooks. |
Ah! I see OK thank you! That makes more sense. Will do |
Merge pull request neurodata#328 from EYezerets/staging
Reference issue
Addresses issue 321
: Run UF figure 1 tutorial with real parameters and makes progress toward issue 62: re-write all the UF notebooks called ProgL repo functions.
Type of change
Documentation, Bug fix
What does this implement/fix?
This was run on AWS with the full data, and the figure is updated. Now, the tutorial for Figure 1 created by @bstraus1 shows the figure run with the test parameters, while this notebook in benchmarks/uf_posterior_visualization shows Figure 1 run using UncertaintyForest() with the real parameters. This also fixes a bug in forest.py (self.max_depth is now used), and allows UncertaintyForest() to take tree_construction_proportion as an argument (set to 0.4 in the original uncertainty forest paper)
Additional information