-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve feature design and selection #309
Comments
Some experiments with features shared by Monique
Now with residual models:
|
Periods: pre-planting (or before emergence), planting (emergence), vegetative, flowering, grain-filling, harvest
See examples in Table 1 and 2 here: https://doi.org/10.1016/j.agsy.2020.103016 |
Example code to test improved features:
|
Thanks Dilli |
I revisited my script to add the static features and to add the target and I realized we did not compare the same things 😞 There were some mistakes in my code. I (hopefully) fixed everything and I was able to run some tests. I used as thresholds:
These are not yet carefully thought (they don't even are related to the periods in your paper). So this choice is still pending. I just wanted to show I was able to run and point some issues in the process.
Original - SklearnRidge {'normalized_rmse': 42.79182885351677, 'mape': 0.41091685689652985, 'r2': -14.011533645734382} Original - RidgeRes {'normalized_rmse': 8.891690336417946, 'mape': 0.0752928093435701, 'r2': 0.3518552208859771}
Original - SklearnRidge {'normalized_rmse': 38.022887420217536, 'mape': 0.47719593050118486, 'r2': 0.16427242385022855} Original - RidgeRes {'normalized_rmse': 48.781203476171804, 'mape': 0.3993343352764365, 'r2': -0.3755600884502228}
Original - SklearnRidge {'normalized_rmse': 20.690912736393592, 'mape': 0.1827639729644808, 'r2': -0.20693118758279616} Original - RidgeRes {'normalized_rmse': 28.566423643255995, 'mape': 0.24972368856241334, 'r2': -1.3005673752745484}
Original - SklearnRidge {'normalized_rmse': 80.53221377905166, 'mape': 0.9290330288460491, 'r2': -3.4319357073350094} Original - RidgeRes {'normalized_rmse': 23.337056497920795, 'mape': 0.2135844654568528, 'r2': 0.6278257418567065} To note:
|
Changing the thresholds (300, 750, 1200, 1600) for wheat NL led to quite better results than my previous ones. But these should really be more adequately determined than only by my testing of values. Original - SklearnRidge {'normalized_rmse': 42.79182885351677, 'mape': 0.41091685689652985, 'r2': -14.011533645734382} |
The basic expectation here is that we should beat the naive or average yield model. Try different ideas and improve feature design or feature selection to make this happen.
The text was updated successfully, but these errors were encountered: