-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor rollups meta (AKA Rollup V2) #42720
Comments
Pinging @elastic/es-analytics-geo |
ey @polyfractal I see this concept of "Grouping Tuple" really interesting for the metrics use case. I think doesn't only apply to rollups, but can be used in some queries too! It actually sounds familiar to what I did in Beats here: elastic/beats#10293, where we want to have enough information to do downsampling on query time to make sure results actually make sense. I wonder if we could reuse this grouping tuple field here? |
Great outline! I think the additional thing that would be good to think about and add to the outline, is what the migration path looks like for users currently using roll ups (I’m assuming that the new field type would happen during a rollup ILM action and therefore wouldn’t require a full re-index or source index?) and also what breaking changes we are proposing (at least @csoulios @polyfractal tagging y'all in case this got lost in the shuffle :) |
Following the implementation of the aggregate_metric_double field mapper(#49830) we are implementing the Min, Max, ValueCount, Sum and Average aggregations on aggregate metrics. The code builds on the excellent work done for #42949 and uses the extensible ValuesSources infrastructure to wire up common metric aggregation on the aggregate_metric_double field type. This PR is part of the rollups v2 refactoring as described in meta issue #42720
Regarding the relation with the Rollup functionality that is already released: Rollup is currently an experimental feature in Elasticsearch. We plan to make Rollup GA when we release the refactored rollup which will make rolled up indices queryable through _search and _async_search and which will make rollup an action in ILM. We are not making Rollup GA before that because we know this change will bring a change in the API. We will make an effort to provide an upgrade path for Rollup indices created with the current Rollup processes to the new refactored Rollup system, which will make these indices queryable through the _search and _async_search API endpoints. |
Will rollups handle new fields arriving in the data? For example winlogbeat adds new fields when one appears in the windows event log. Would this new field get rolled up automatically or would we have to manually modify the rollup job? |
This commit moves the ownership of tracking the rollup_index from the RollupActionConfig to the RollupAction.Request. This is cleaner since the config should not be concerned with the source and rollup indices. relates elastic#42720.
This commit moves the ownership of tracking the rollup_index from the RollupActionConfig to the RollupAction.Request. This is cleaner since the config should not be concerned with the source and rollup indices. relates #42720. Co-authored-by: James Rodewig <[email protected]>
This commit moves the ownership of tracking the rollup_index from the RollupActionConfig to the RollupAction.Request. This is cleaner since the config should not be concerned with the source and rollup indices. relates #42720. Co-authored-by: James Rodewig <[email protected]>
this commit introduces a new Rollup ILM Action that allows indices to be rolled up according to a specific rollup config. The action also allows for the new rolled up index to be associated with a different policy than the original/source index. Relates elastic#42720. Closes elastic#48003.
The ability for rollup API to generate time-based indices that can be managed/retired over time is a common ask from the field. +1 on ILM integration. |
Will the new rollup feature also support |
The team has decided that there is a great fit between rollups and the new metrics database we are implementing (#74660). The fact that dimensions and metrics are first class citizens allows us to simplify the rollup configuration. Also, leveraging the index structure in TSDB results in greatly improved performance. The team views rollups as a fundamental operation of a metrics database and we focus all our efforts to develop such a feature. Hence, I am closing this issue and work on rollups/downsampling will be tracked inside #74660. |
This is a meta issue discussing a complete rewrite of the Elasticsearch rollup codebase with the aim to improve the following points:
Below we outline a high level plan of changes that will help us achieve the above goals:
_rollup_search
endpoint in favour of implementing search on rollup indexes within the_search
endpoint ([RollupV2] Implement search resolution #67783) @csouliosaggregate_metric
field type that will store pre-aggregated metrics (min
,max
,sum
etc) for a specific rollup group. (Implement aggregate_metric field mapper #49830)aggregate_metric
fields. Theaggregate_metric
field type will provide the correct metric to the requesting aggregator, while using the same field name. This field type will not be indexed but stored as a binary doc values. (Implement aggregations on aggregate metrics #53986)rollup_meta
cluster metadata to store information such as date_histogram interval and timezone etc. (Add RollupV2 cluster metadata behind feature-flag #64680)_doc_count
field andaggregate_metric_double
fieldtypes (Add RollupAction support for AggregateDoubleMetric fields #70534)delay
option, need to make a new Action-specific config without a delayThe text was updated successfully, but these errors were encountered: