Skip to content

Latest commit

 

History

History
65 lines (43 loc) · 2.43 KB

README.md

File metadata and controls

65 lines (43 loc) · 2.43 KB

sktime-fable-hierarchical forecasting experiement

Here the plan is to use both a real world dataset (labour) and a synthetic data set to compare the hierachical forecasting results from sktime (python) and fable (R).

The goal is to ensure the sktime hierarchical implementation is consistent with the fable one. Therefore we will use a simple and consistent base forecaster across the two libraries.

Environment

To reproduce the results, the environment can be recreated as follows.

Please run the python notebooks before the R ones for each experiment :)

If you don't want to re-run it, the output is saved as a .html file which can be loaded up in a browser.

conda create -n sktime_hier python=3.7 -y
conda activate sktime_hier
pip install sktime
pip install datasetsforecast
pip install requests
pip install statsforecast
conda install ipykernel -y

Overall Results

Sktime is at least on par with fable in terms of the reconciliation methods which was our goal at the start. There are some minor differences between the methods that use the residual covariance matrix, due to how they are calculated under the hood. Note, results with * indicate there is probably a problem with the empirical residual covariance matrix. There is a bug in the top down reconciliation method in fable which has kindly already been reported here by the folks from Nixtla. The forecasts are evaluated using RMSE.

Tourism Data
Model Sktime Fable
base 17.523 17.523
bu 17.819 17.819
mint_cov 17.519 92.324*
mint_shrink 17.580 17.596
ols 17.519 17.519
td_fcst 17.466 -
wls_str 17.505 17.614
wls_var 17.573 17.614
Synthetic Data

The forecasts are evaluated using RMSE.

Model Sktime Fable
base 2601.781 2601.781
bu 2601.857 2601.857
mint_cov 496458.562* -
mint_shrink 2601.781 2601.829
ols 2601.781 2601.781
td_fcst 2601.771 -
wls_str 2601.776 2601.800
wls_var 2601.776 2601.800