[ML] Extend unit testing and make some improvements to deal with changes in seasonal components #91

tveasey · 2018-05-11T09:28:46Z

In the course of other testing related to #6, I found evidence of potential instability in the decomposition when the seasonal components in the time series abruptly change. In addition, our lifecycle management of seasonal components was not reliably removing components which were no longer improving predictions. This was related to the way we were trying to preserve components which carry information about periodic variance in the data.

This adds a unit test to exercise these problems and makes some related improvements. As part of these I've implemented a failsafe to remove all seasonal components and start again if instability occurs. The current behaviour on this unit test (red actuals, blue predictions) is as follows:

For reference, this is what was happening before:

This can affect results on metric and count analysis with seasonal signals where seasonality changes significantly from time to time.

Release note: Improves behavior when there are abrupt changes in the seasonal components present in a time series

…onents

droberts195 · 2018-05-11T09:57:53Z

lib/maths/CTimeSeriesDecompositionDetail.cc

@@ -255,6 +258,7 @@ const TSizeVecVec SC_TRANSITION_FUNCTION{
    TSizeVec{SC_NORMAL, SC_NORMAL, SC_NORMAL, SC_NORMAL}};

 const std::string VERSION_6_3_TAG("6.3");
+const std::string VERSION_6_4_TAG("6.4");


We probably want to prevent a job that has written 6.4 state from running on a 6.3 node in a mixed 6.3/6.4 cluster. I guess if this isn't done then a couple of fields won't get initialised. The way to do that is to change line 78 of CAnomalyJob.cc.

droberts195 · 2018-05-11T09:59:18Z

docs/CHANGELOG.asciidoc

@@ -29,6 +29,7 @@

 Improve and use periodic boundary condition for seasonal component modeling ({pull}84[#84])
 Improve robustness w.r.t. outliers of detection and initialisation of seasonal components ({pull}90[#90])
+Extend unit testing and make some improvements to deal with changes in seasonal components ({pull}91[#91])


I don't think the release notes need to mention unit test changes - someone trying to find out what's new in ML isn't going to care.

Fair enough, I'll update.

droberts195 · 2018-05-11T10:02:23Z

lib/maths/CSeasonalComponent.cc

-        scale = CTools::truncate(1.002 * scale - 0.001, 0.0, 1.0);
-
-        return -scale * min[0] * CTools::sign(shortPeriodValue);
+        return bias.signMargin() != 0.0 ? bias.signMargin() : cancellation.signMargin();


Would you want to treat very small numbers as 0? For example, if bias.signMargin() was 1e-308 would you still want to use it? (This is only a question - if the answer is yes then fine.)

One could, for example, not bother if it is << amplitude. The chances of this being very small, but not identically zero, is actually small. If there is no bias then we expect the differences at the different times in the period to not all have the same sign with a high probability because noise in the underlying values should cause them to randomly land on different sides of the mean. Also, it doesn't have a significant impact to have a small, but non-zero, delta. So on balance I don't think this warrants the extra complication.

droberts195

LGTM

tveasey · 2018-05-14T17:52:12Z

This turned out to be more involved: testing with variable decay rate I again hit the instability. I've therefore significantly extended the original PR:

I did some more work on the delta calculation. In particular, it minimises the sum component amplitude.
I added in active monitoring for signs of instability, by inspecting the sum component amplitude, and reduce the gain if the modelling displays any symptoms of instability.
I made sure we remove all components if the prediction accuracy drops below the reference.

I split the extra changes into two commits. The first is a pure refactor to better encapsulate seasonal and calendar components. The second makes the functional changes. Can you take another look @droberts195.

droberts195

LGTM

…omponents (#91)

Add a unit test and make some improvements for changing periodic comp…

e941f67

…onents

tveasey added >enhancement v7.0.0 review :ml v6.4.0 labels May 11, 2018

Update change log

91cca9a

droberts195 reviewed May 11, 2018

View reviewed changes

tveasey added 2 commits May 11, 2018 11:26

Merge branch 'master' into enhancement/improve-trend-component-lifecycle

d485966

Review comments

b63520a

droberts195 approved these changes May 11, 2018

View reviewed changes

Refactor to better encapsulate seasonal and calendar components

7d66cb4

droberts195 approved these changes May 15, 2018

View reviewed changes

tveasey added 2 commits May 15, 2018 09:45

Add gain control. Unit testing with variable decay rate. Improve naming.

e2663a5

Fix formatting

d6b87d3

tveasey force-pushed the enhancement/improve-trend-component-lifecycle branch from d716093 to d6b87d3 Compare May 15, 2018 08:48

tveasey merged commit f3d4e71 into elastic:master May 15, 2018

tveasey added a commit that referenced this pull request May 22, 2018

[ML] Address instability associated with sudden changes in seasonal c…

021c26e

…omponents (#91)

tveasey deleted the enhancement/improve-trend-component-lifecycle branch December 13, 2018 08:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Extend unit testing and make some improvements to deal with changes in seasonal components #91

[ML] Extend unit testing and make some improvements to deal with changes in seasonal components #91

tveasey commented May 11, 2018 •

edited by lcawl

Loading

droberts195 May 11, 2018

droberts195 May 11, 2018

tveasey May 11, 2018

droberts195 May 11, 2018

tveasey May 11, 2018

droberts195 left a comment

tveasey commented May 14, 2018

droberts195 left a comment

[ML] Extend unit testing and make some improvements to deal with changes in seasonal components #91

[ML] Extend unit testing and make some improvements to deal with changes in seasonal components #91

Conversation

tveasey commented May 11, 2018 • edited by lcawl Loading

droberts195 May 11, 2018

Choose a reason for hiding this comment

droberts195 May 11, 2018

Choose a reason for hiding this comment

tveasey May 11, 2018

Choose a reason for hiding this comment

droberts195 May 11, 2018

Choose a reason for hiding this comment

tveasey May 11, 2018

Choose a reason for hiding this comment

droberts195 left a comment

Choose a reason for hiding this comment

tveasey commented May 14, 2018

droberts195 left a comment

Choose a reason for hiding this comment

tveasey commented May 11, 2018 •

edited by lcawl

Loading