Performance Metrics Tracking #8931

arindam1993 · 2019-11-01T03:37:00Z

What?

Add tooling and a benchmark suite that can be used to automatically gather performance metrics across multiple browsers and devices.

Why?

Make it easier to detect and catch performance regressions automatically, also measure the overall health of our sdk.

Which metrics?

Load Time metrics:

Measures "How soon till I see something on screen?"

loadTime (ms): Time elapsed from new Map() to map.on('load'), this is the time in which all the vector geometry has filled the screen.
fullLoadTime (ms): Time elapsed from new Map() to map.on('content.load') ( this is new, so name is subject to change), this is the time in which all content defined by the style have been loaded, this includes placement finishing for text labels, raster tiles finishing loading for satellite and/or hillshade layers etc.

Rendering Performance Metrics:

Measures "How fluid does it feel to interact with the map?"

This is slightly tricker to quantify, average frame-rate is a good measure but doesn't completely capture all sources of jank/hitch in an interactive application. For that we'd have to track frame-time, which the amount of time required to render each frame, but this not a single number, we need to track this for every frame we're rendering so we can detect spikes.
Here's a rather interesting article going into the details of why (https://cgvr.cs.ut.ee/wp/index.php/frame-rate-vs-frame-time/)
In order to quantify jank from our frame-time data, we can take the top 1% of our slowest frames, and measure their average frame-rate. The closer our 1% Low FPS is to our Average FPS the less janky our rendering engine is.

frameTimes Array: an array of millisecond values that track the amount of time it took to render each frame, this can be used to plot a frametime chart.
fps (frames per second): The average frames per second for the entire run
onePercentLowFps (frames per second): The framerate of our 1% slowestframes.

How?

Add instrumentation to gl-js that tracks these metrics and can output them.
Add build tooling that can strip out instrumentation for release builds.
Add a benchmark suite that defines certain styles, and Map operations on those styles.
Similar to the query test runner, build a test-page that when loaded in a browser starts running the benchmark suite, this will make it easy to run the suite on multiple browsers on various devices.
After some discussion with @ryanhamley today:
Check consistency of metrics on CI-runs
Integrate with memory profiling (Add some memory stats to gl-stats #8949)
Potentially integrating with gl-stats?

How to run?

yarn run watch-metrics
Go to: http://localhost:9966/bench/metrics.html
look at the map do its thing and log data into the console, (once warmup runs are done)

ryanhamley · 2019-11-01T18:42:16Z

Add a benchmark suite that defines certain styles, and Map operations on those styles.

We do have style benchmarking already which runs the benchmark suite against various styles. You could probably leverage that to add operations to run against the styles.

Similar to the query test runner, build a test-page that when loaded in a browser starts running the benchmark suite, this will make it easy to run the suite on multiple browsers on various devices.

The benchmark suite already runs in the browser. Or are you talking about a totally new benchmark suite? If so, we might want to come up with some different terminology to disambiguate them

* added a metrics runner harness * added rollup config to build metrics runner * added metrics page

arindam1993 · 2019-11-05T20:57:53Z

Add a benchmark suite that defines certain styles, and Map operations on those styles.

We do have style benchmarking already which runs the benchmark suite against various styles. You could probably leverage that to add operations to run against the styles.

The benchmark suite is built for gathering granular performance data for very specific sections for the pipeline. The idea with this is to gather higher-level performance metrics for the entire system.
I think benchmark suite acts kind of like unit-tests for performance, whereas this acts more like an integration test for performance. The data gathered will not be as granular, but it should be a better summary that helps us catch performance regressions.So I'm trying to go with something that has a similar , if not inter-compatible format with our render/query test fixtures.

Similar to the query test runner, build a test-page that when loaded in a browser starts running the benchmark suite, this will make it easy to run the suite on multiple browsers on various devices.

The benchmark suite already runs in the browser. Or are you talking about a totally new benchmark suite? If so, we might want to come up with some different terminology to disambiguate them

I agree, I think calling it the performance-metrics suite makes more sense?

# Conflicts: # package.json # src/ui/map.js

fix #8958

arindam1993 · 2019-11-11T19:55:10Z

Speaking to @ClareTrainor, we realized that it would be helpful to allow to dynamically override the style so the map design team can generate these metrics while keeping the SDK a constant, and varying the style, whereas for us we can keep the style a constant while varying the SDK.

move featureMap from ProgramConfiguration to ProgramConfigurationSet. All configurations within a set have the same vertex layout (because they go together with the same vertex buffers). This doesn't matter too much because the only time a set has more than one programconfiguration is when multiple layers have identical layout properties. The main goal here is to make the relationship a tiny bit clearer.

…to perf-mark

- Add circle job to run metrics

- Implement messaging system between browser<->puppeteer - Calculate mean and std-deviation of metrics for final output

- add persisting of raw metrics

arindam1993 · 2019-11-27T22:36:18Z

Closing this, and moving the discussion to a separate issue. Most of the driver code should be split out into a separate repo.

Initial commit of stats gathering

63667fd

ryanhamley changed the title ~~Performance Mertics Tracking~~ Performance Metrics Tracking Nov 1, 2019

Arindam Bose added 4 commits November 1, 2019 14:01

Fix failing tests and remove unnecessary performance wrapper

2c1c842

Strip performance probes from production build

b2a7973

Rollback accidental //@flow annotation

2762739

First hacky version of automated metrics suite

740f770

* added a metrics runner harness * added rollup config to build metrics runner * added metrics page

arindam1993 mentioned this pull request Nov 5, 2019

Reduce allocated GPU textures #8814

Closed

Arindam Bose added 2 commits November 5, 2019 16:15

Test with slower pan animation

fd77dfe

Refactor generateFixtureJson to take split root and suite paths

91be5ff

ryanhamley mentioned this pull request Nov 6, 2019

Add some memory stats to gl-stats #8949

Closed

1 task

Arindam Bose and others added 6 commits November 5, 2019 19:01

Handle running of refactored metrics suite.

e43f15e

Cleanup flow error and accidental checkin of generated file

6878686

Fix lint fail due to missing fixtures.json

2a6c7ec

Merge branch 'master' of github.com:mapbox/mapbox-gl-js into perf-mark

c07c805

# Conflicts: # package.json # src/ui/map.js

min and max pitch options (#8834)

cbbbbfc

Revert performance marks in createStyleLayer

364f8a4

arindam1993 requested a review from mourner November 7, 2019 02:49

nicholas-l and others added 4 commits November 8, 2019 17:08

Change maximum of function_stop from 22 to 24 (#8908)

37eda90

Upgrade @mapbox/gazetteer to v4.0.4 (#8955)

4a8fe0c

Get named export (#8957)

da00339

fix codegen unknown type issue (#8959)

0abe854

fix #8958

ansis and others added 6 commits November 11, 2019 15:22

Merge branch 'perf-mark' of https://github.com/mapbox/mapbox-gl-js in…

c8c8753

…to perf-mark

Begin integrating with Puppeteer

1dbf2c9

- Fix pupeteer integrations

65293f7

- Add circle job to run metrics

Fix metrics path

a8e4b42

Cleanup of computed metrics:

fc252f1

- Implement messaging system between browser<->puppeteer - Calculate mean and std-deviation of metrics for final output

Arindam Bose added 13 commits November 13, 2019 20:55

- add timeout to metrics job

9897c9d

Fix timeout syntax

89f8ed2

Make it headless

11307fe

- Fix memory profiling happening after map destroy

5a276fa

- add persisting of raw metrics

Up runs to 5

04d703a

Up runs to 10

84ad6b3

Up runs to 15

cc11315

Up runs to 20

05ecd97

Test data variablity based on warmup runs

83250d7

Measure run-to-run variance

7ce56bc

massive fan out of metrics jobs

bf72ebe

fix spacing

249425c

add alias

c37b83a

arindam1993 mentioned this pull request Nov 27, 2019

Add instrumentation for performance metrics tracking #9035

Merged

1 task

arindam1993 closed this Nov 27, 2019

arindam1993 mentioned this pull request Dec 12, 2019

Un-hardcode test fixtures root path for fixture builder #9105

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance Metrics Tracking #8931

Performance Metrics Tracking #8931

arindam1993 commented Nov 1, 2019 •

edited

Loading

ryanhamley commented Nov 1, 2019

arindam1993 commented Nov 5, 2019

arindam1993 commented Nov 11, 2019

arindam1993 commented Nov 27, 2019

Performance Metrics Tracking #8931

Performance Metrics Tracking #8931

Conversation

arindam1993 commented Nov 1, 2019 • edited Loading

What?

Why?

Which metrics?

Load Time metrics:

Rendering Performance Metrics:

How?

How to run?

ryanhamley commented Nov 1, 2019

arindam1993 commented Nov 5, 2019

arindam1993 commented Nov 11, 2019

arindam1993 commented Nov 27, 2019

arindam1993 commented Nov 1, 2019 •

edited

Loading