[ML] Improvements to trend modelling and periodicity testing for forecasting #7

tveasey · 2018-02-23T18:32:18Z

Description
This is a merge of a feature branch for issue #5. This work has already been reviewed as individual changes; however, I had some merge collisions with other recent changes.

Effects
This affects results, both model bounds and anomaly detection, for all data sets for which we use trend and/or seasonality modelling. In practice this is most data sets for non-rare functions. We've investigated changes across our full benchmark set. In all but one case, where results have changed appreciably, our anomaly detection detection accuracy has improved. Of course the primary driver is forecasting, which is now more reliable, has greatly improved confidence intervals and doesn't display obvious pathologies, particularly for longer time ranges.

This introduces a mean 30% memory increase across our benchmark set. The reasons for this are unavoidable and can be broken down as:

The trend model itself is significantly larger (rather than 1 we now have 8 at different time scales).
We have the trend model all the time, previously this was created if we determined it was worthwhile to do so, but we need this model for forecasting and can't create it retrospectively.
The new style periodicity testing uses slightly more memory.
We have some extra state in the seasonal components for forecasting.
We sometimes find additional periodic components in the data (with associated accuracy benefits).

These costs can be offset against some recent memory wins going into 6.3.

droberts195

I noticed one thing that probably slipped between the cracks of the previous reviews of incremental changes.

droberts195 · 2018-02-27T19:02:44Z

lib/model/CAnomalyDetector.cc

@@ -91,7 +91,7 @@ CAnomalyDetector::TModelPtr makeModel(const CAnomalyDetector::TModelFactoryCPtr

 // Increment this every time a change to the state is made that requires
 // existing state to be discarded
-const std::string CAnomalyDetector::STATE_VERSION("34");
+const std::string CAnomalyDetector::STATE_VERSION("35");


I don't think we should be doing this now the changed portion of the state is being upgraded.

Hopefully reverting this line won't mean regenerating any of the upgrade test files, but please can you check?

tveasey · 2018-02-27T19:38:51Z

Thanks @droberts195 it had indeed: corrected.

droberts195

LGTM

…ng (#7) This is a merge of a feature branch for issue #5.

This was inadvertently added in #7

…ng (#7) This is a merge of a feature branch for issue #5.

This was inadvertently added in #7

…ng (#7) This is a merge of a feature branch for issue #5.

This was inadvertently added in #7

Improvements to trend modelling and periodicity testing for forecasting.

32633b6

tveasey added >enhancement v7.0.0 v6.3.0 labels Feb 23, 2018

tveasey requested a review from droberts195 February 23, 2018 18:32

droberts195 reviewed Feb 27, 2018

View reviewed changes

Revert model version bump

b638ac7

droberts195 approved these changes Feb 27, 2018

View reviewed changes

tveasey added 2 commits February 27, 2018 12:00

Fix time of day/month dependent test failure

307c0dc

Be a bit more careful with forecast variance calculation

582cb1c

tveasey merged commit 64fb093 into elastic:master Feb 27, 2018

tveasey mentioned this pull request Feb 27, 2018

[ML] Improvements to forecasting robustness (part 1) #5

Closed

9 tasks

tveasey added a commit that referenced this pull request Mar 8, 2018

Improvements to trend modelling and periodicity testing for forecasti…

408a7a7

…ng (#7) This is a merge of a feature branch for issue #5.

droberts195 added a commit that referenced this pull request Mar 9, 2018

Remove duplicate class name prefix

af320b1

This was inadvertently added in #7

droberts195 added a commit that referenced this pull request Mar 9, 2018

Remove duplicate class name prefix

bc46b68

This was inadvertently added in #7

sophiec20 changed the title ~~Improvements to trend modelling and periodicity testing for forecasting~~ [ML] Improvements to trend modelling and periodicity testing for forecasting Mar 28, 2018

sophiec20 added the :ml label Apr 4, 2018

droberts195 pushed a commit that referenced this pull request Apr 23, 2018

Improvements to trend modelling and periodicity testing for forecasti…

ec7bcf4

…ng (#7) This is a merge of a feature branch for issue #5.

droberts195 added a commit that referenced this pull request Apr 23, 2018

Remove duplicate class name prefix

616f571

This was inadvertently added in #7

droberts195 pushed a commit that referenced this pull request Apr 23, 2018

Improvements to trend modelling and periodicity testing for forecasti…

d8a6687

…ng (#7) This is a merge of a feature branch for issue #5.

droberts195 added a commit that referenced this pull request Apr 23, 2018

Remove duplicate class name prefix

c1b8d66

This was inadvertently added in #7

lcawl mentioned this pull request Jun 8, 2018

[DOCS] Add machine learning release highlights elastic/elasticsearch#31214

Merged

davidkyle mentioned this pull request Jun 20, 2023

[NLP] Catch exceptions thrown during inference and report as errors #2542

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Improvements to trend modelling and periodicity testing for forecasting #7

[ML] Improvements to trend modelling and periodicity testing for forecasting #7

tveasey commented Feb 23, 2018 •

edited by droberts195

Loading

droberts195 left a comment

droberts195 Feb 27, 2018

tveasey commented Feb 27, 2018

droberts195 left a comment

[ML] Improvements to trend modelling and periodicity testing for forecasting #7

[ML] Improvements to trend modelling and periodicity testing for forecasting #7

Conversation

tveasey commented Feb 23, 2018 • edited by droberts195 Loading

droberts195 left a comment

Choose a reason for hiding this comment

droberts195 Feb 27, 2018

Choose a reason for hiding this comment

tveasey commented Feb 27, 2018

droberts195 left a comment

Choose a reason for hiding this comment

tveasey commented Feb 23, 2018 •

edited by droberts195

Loading