feat: add aggregated rocksdb metrics #6354

rodesai · 2020-10-02T23:06:18Z

This patch adds a pattern for computing/reporting metrics that return aggregates
of the rocksdb metrics added by KIP-607. Additionally, this particular PR adds the
following metrics:

num-running-compactions-total: the total number of running compactions
estimate-num-keys-total: an estimate of the total number of rocksdb keys
block-cache-usage-total: total memory usage of all block cache
block-cache-pinned-usage-total: total memory used by pinned blocks
estimate-table-readers-mem-total: estimate of the total table readers mem

ksqlDB registers for notification about new rocksdb metrics by creating a
MetricsReporter implementation called RocksDBMetricCollector. The metrics
system calls into MetricsReporter.metricChange when a new metric is added.
RocksDBMetricCollector looks out for rocksdb property metrics it cares about
and tracks them under the relevant aggregates. Each aggregate is registered
with the ksql metrics context on the first instantiation of
RocksDBMetricCollector.

Metrics are computed lazily when read, and are rate-limited to a configurable
interval. The interval is set using the property
ksql.rocksdb.metrics.update.interval.seconds

One alternative I considered was to dynamically add the metrics as they are sent
to RocksDBMetricCollector.metricChange (rather than hard-coding a static list).
Opted not to do this in case we add metrics in the future that use different types,
or want to compute different aggregates (e.g. for some metrics maybe a max or
average makes more sense)

Testing done

Ran our aggregation benchmark with these metrics collected and 1000 partitions
and didn't see any perf regression (processing rate 39098 records/second)

This patch adds metrics that return aggregates of the rocksdb metrics added by KIP-607. Specifically, this particular PR adds the following metics: num-running-compactions-total: the total number of running compactions estimate-num-keys-total: an estimate of the total number of rocksdb keys block-cache-usage-total: total memory usage of all block cache block-cache-pinned-usage-total: total memory used by pinned blocks estimate-table-readers-mem-total: estimate of the total table readers mem ksqlDB registers for notification about new rocksdb metrics by creating a MetricsReporter implementation called RocksDBMetricCollector. The metrics system calls into MetricsReporter.metricChange when a new metric is added. RocksDBMetricCollector looks out for rocksdb property metrics it cares about and tracks them under the relevant aggregates. Each aggregate is registered with the ksql metrics context on the first instantiation of RocksDBMetricCollector. Metrics are computed lazily when read, and are rate-limited to a configurable interval. The interval is set using the property ksql.rocksdb.metrics.update.interval.seconds

vvcephei

Thanks, @rodesai , looks reasonable to me!

rodesai requested a review from a team as a code owner October 2, 2020 23:06

rodesai force-pushed the add-rocks-metrics branch from 7202726 to 8e6a575 Compare October 2, 2020 23:07

rodesai requested review from cadonna, vvcephei and ableegoldman October 2, 2020 23:08

rodesai added 3 commits October 5, 2020 10:33

add some max metrics + fixes

a4fc9fb

checkstyle

15a4b66

bad file

ac60087

vvcephei approved these changes Oct 16, 2020

View reviewed changes

rodesai merged commit ecc6625 into confluentinc:master Oct 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add aggregated rocksdb metrics #6354

feat: add aggregated rocksdb metrics #6354

rodesai commented Oct 2, 2020 •

edited

Loading

vvcephei left a comment

feat: add aggregated rocksdb metrics #6354

feat: add aggregated rocksdb metrics #6354

Conversation

rodesai commented Oct 2, 2020 • edited Loading

Testing done

vvcephei left a comment

Choose a reason for hiding this comment

rodesai commented Oct 2, 2020 •

edited

Loading