-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Save interpretations #12
Conversation
* finish download module changes * download notebook * rerun split data module * rerun download module * rerun train_model * rerun evaluation module * rerun interpretation module * combine datasets * combine datasets * split changes * update format * format update * format * finish split data * combine datasets, remove holdout * formatting * rerun pipelines * remove folded class * rerun pipeline * Update utils/download_utils.py Co-authored-by: Dave Bunten <[email protected]> * PR fixes * module docstrings Co-authored-by: Dave Bunten <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
A couple discussion items that you may decide to change:
- I think adding the term
coefficients
to the notebook (currentlyinterpret_model.ipynb
) might be more explicit about what going on - Currently, the features are being saved as their index? Is this true? Could you instead use their feature name? This will prevent the scenario of incorrect coefficient index assignment.
- Are these features DP or CP? It's probably worth specifying in the file name.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice work! I only had minor comments and suggestions with this review for your consideration. Overall LGTM!
In response to @gwaybio, Suggestions 1 and 2 have been implemented in bc1bf11. |
This PR is ready for review!
In this PR, model coefficients are saved for the final and shuffled baseline models. These coefficients are saved in the tidy long standardized data format.