Expose agg usage in Feature Usage API #53746

polyfractal · 2020-03-18T16:28:57Z

After the ValuesSource refactor lands, all aggregation usage will flow through the new registry. This gives us an opportunity to easily count usage of the aggs and expose on the Nodes feature usage API.

The API is helpful for single-purpose endpoints, but endpoints like Search have a huge number of disparate actions which are all lumped together. Searches, aggregations, highlighting, suggestions, scrolling, composite, etc etc all show up under the Search endpoint.

There are many times an administrator would like to see more granular data about what's flowing through their Search endpoint, without enabling slow logs and picking through the firehose. Since queries and aggregations are among the more expensive operations, we should provide more granular feedback on these activities. With a relatively small change to the VS registry we can record how often agg builders are parsed and give an administrator a high-level overview of usage.

It would also make opt-in telemetry simpler and less invasive to collect for XPack features.

elasticmachine · 2020-03-18T16:28:58Z

Pinging @elastic/es-analytics-geo (:Analytics/Aggregations)

imotov · 2020-04-22T18:42:25Z

We have two ways of implementing this feature: incrementing the usage counter on the parser invocation and by incrementing the counter on ValuesSourceRegistry access.

In case of parser, we can count high level invocations. Basically for each search that contains one aggregation of a certain type we will count it as one use. Unfortunately, on this level we will not have information about value type the aggregation is executed one. So for example if a percentiles was executed we will not know if it was executed on histogram data type or not.

In case of ValuesSourceRegistry access counters, we will now the type, but we will count each shard access as a separate invocation. So, if we execute a single search with a single aggregation on 2 indices with 5 shards each, we will count his as 10 invocation. We can do some tricks like counting it only on shard 0. In this case it will be counted as 2 invocations.

polyfractal · 2020-04-22T19:03:13Z

I'm tentatively leaning towards the second option. I think it'd be more useful to know the complete list of aggs being used, as well as which field types they are being used against... even if it means they are "over-counted" by doing it per-shard. My feeling is that the absolute numbers don't matter as much as relative numbers?

E.g. as an administrator I think I'd care that an expensive scripted-metric might be running two orders of magnitude more than all the other aggs, but not necessarily care about the exact numbers. So it's all relative and doesn't really rely on precise once-per-query counting.

giladgal · 2020-04-22T19:15:26Z

I agree between these two the second option seems better.

imotov · 2020-04-22T19:45:07Z

Thanks! I am going with the second option then.

imotov · 2020-04-24T15:12:09Z

At the end of the day @not-napoleon found a good place to make a single call per parsing while having values source type in hand.

Counts usage of the aggs and exposes them on the _nodes/usage/. Closes elastic#53746

* Expose agg usage in Feature Usage API Counts usage of the aggs and exposes them on the _nodes/usage/. Closes #53746 * Refactor to include non value sources aggregations * Fix reported values source type for parent and children aggs * Refactor SearchModule constructor * Fix subtype in TTest and IPRanges * Fix more subtypes in aggs that don't register themselves * Fix doc tests * Fix docs * Fix ScriptedMetricAggregatorTests * Fix compilation issues after merge * Fix merge fallout * This gets stale quickly... * Address review comments * Fix tests that were missing proper agg registration in the search module * Fix ScriptedMetricAggregatorTests * Address review comments Co-authored-by: Elastic Machine <[email protected]>

Counts usage of the aggs and exposes them on the _nodes/usage/. Closes elastic#53746

Counts usage of the aggs and exposes them on the _nodes/usage/. Closes #53746

$@polyfractal$ polyfractal added the :Analytics/Aggregations Aggregations label Mar 18, 2020

$@polyfractal$ polyfractal assigned not-napoleon Mar 30, 2020

imotov assigned imotov and unassigned not-napoleon Apr 22, 2020

imotov added a commit to imotov/elasticsearch that referenced this issue Apr 24, 2020

Expose agg usage in Feature Usage API

c0a7dd4

Counts usage of the aggs and exposes them on the _nodes/usage/. Closes elastic#53746

imotov mentioned this issue Apr 24, 2020

Expose agg usage in Feature Usage API #55732

Merged

imotov closed this as completed in #55732 Apr 30, 2020

$@polyfractal$ polyfractal added v7.8.0 v8.0.0 labels Apr 30, 2020

imotov added a commit to imotov/elasticsearch that referenced this issue Apr 30, 2020

Expose agg usage in Feature Usage API (elastic#55732)

d0fdc5c

Counts usage of the aggs and exposes them on the _nodes/usage/. Closes elastic#53746

imotov mentioned this issue Apr 30, 2020

[7.x] Expose agg usage in Feature Usage API (#55732) #56048

Merged

imotov added a commit that referenced this issue Apr 30, 2020

Expose agg usage in Feature Usage API (#55732) (#56048)

d8f9df7

Counts usage of the aggs and exposes them on the _nodes/usage/. Closes #53746

russcam mentioned this issue May 29, 2020

7.8.0 Meta ticket elastic/elasticsearch-net#4718

Closed

17 tasks

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose agg usage in Feature Usage API #53746

Expose agg usage in Feature Usage API #53746

polyfractal commented Mar 18, 2020

elasticmachine commented Mar 18, 2020

imotov commented Apr 22, 2020

polyfractal commented Apr 22, 2020

giladgal commented Apr 22, 2020

imotov commented Apr 22, 2020

imotov commented Apr 24, 2020

Expose agg usage in Feature Usage API #53746

Expose agg usage in Feature Usage API #53746

Comments

polyfractal commented Mar 18, 2020

elasticmachine commented Mar 18, 2020

imotov commented Apr 22, 2020

polyfractal commented Apr 22, 2020

giladgal commented Apr 22, 2020

imotov commented Apr 22, 2020

imotov commented Apr 24, 2020