Percentile Support #93

jdhardy · 2017-11-08T17:49:15Z

Fix #92 by adding support for calculating & displaying arbitrary percentiles, by using --benchmark-columns=p99.9. Only requested percentile columns are calculated and displayed. They are also preserved in CSV and JSON output formats if they are requested.

Had some issues running tox tests, but the core variants all passed.

Calculate interpolted percentiles using the NIST method. This ensures that p0 == min, p50 == median, p100 == max.

To show percentile columns, use --benchmark-columns=p99.9. Requested columns are added to the flattened output dict for display. Percentile columns are only calculated and shown if requested. Cache perecntile results to avoid recalculation. Add usage note to documentation.

jdhardy · 2017-11-08T17:55:30Z

Here's what the output looks like with --benchmark-columns=Min,Median,P99.9,Max,Ops:

================================================= test session starts ==================================================
platform darwin -- Python 3.6.3, pytest-3.2.3, py-1.4.34, pluggy-0.4.0
benchmark: 3.1.1 (defaults: timer=time.perf_counter disable_gc=False min_rounds=5 min_time=0.000005 max_time=1.0 calibration_precision=10 warmup=False warmup_iterations=100000)
rootdir: /.../tests/perf, inifile: pytest-perf.ini
plugins: cov-2.5.1, benchmark-3.1.1
collected 3 items

tests/perf/test_Perf.py ...


-------------------------------------------- benchmark 'test_benchmark': 3 tests ---------------------------------------------
Name (time in ms)                  Min             Median               P99.9                 Max                OPS
------------------------------------------------------------------------------------------------------------------------------
benchmark[2000-xxxxxxxx]        9.0091 (1.0)      11.9048 (1.0)       43.1680 (1.0)       43.1680 (1.0)      78.7381 (1.0)
benchmark[200-xxxxxxxx]        11.8196 (1.31)     12.3910 (1.04)     401.4592 (9.30)     401.4592 (9.30)     47.8764 (0.61)
benchmark[200000-xxxxxxxx]     33.7439 (3.75)     37.7569 (3.17)      74.0722 (1.72)      74.0722 (1.72)     25.5273 (0.32)
------------------------------------------------------------------------------------------------------------------------------

Legend:
  Outliers: 1 Standard Deviation from Mean; 1.5 IQR (InterQuartile Range) from 1st Quartile and 3rd Quartile.
  OPS: Operations Per Second, computed as 1 / Mean
============================================== 3 passed in 14.32 seconds ===============================================

ionelmc · 2017-11-10T00:03:00Z

This looks pretty good. I still need to look over the code a bit but there is one thing that makes me a bit wary: the fact that the pXX.X column only gets in the saved data if it's active in the display (via --benchmark-columns).

This would be a problem if the user doesn't save the data (via either --benchmark-save-data or --benchmark-columns=pXX.X) and then tries to look at previous run data with a new or different pXX.X. This is basically my fault here, I shouldn't have implemented --benchmark-save-data (the include_data in the internals) so I wonder if it's time to remove it, always include full data and just let users live the few MBs of json ...

ionelmc · 2017-11-10T00:13:44Z

I would love to hear some opinions on this problem.

jdhardy · 2017-11-30T20:42:29Z

Storing the percentiles was one part I wasn't sure about. Perhaps a new --benchmark-save-columns option for those that don't want to/can't store all data?

ionelmc · 2018-02-08T23:03:29Z

So I've been thinking about this again, and I want this feature. But I was these stats to be always saved.

@jdhardy Would you think people would ever want anything else besides p90, p95, p99, p99.9 and p99.99?

jdhardy · 2018-02-08T23:37:27Z

*Possibly* p99.999. I've never seen anything else used around here, anyway. Seems like that's enough to start with and see if anyone requests others.

…

On Thu, Feb 8, 2018 at 3:03 PM, Ionel Cristian Mărieș < ***@***.***> wrote: So I've been thinking about this again, and I want this feature. But I was these stats to be always saved. @jdhardy <https://github.com/jdhardy> Would you think people would ever want anything else besides p90, p95, p99, p99.9 and p99.99? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#93 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAGmdQ0SFDz7FFUxb7thjVTup4QBl1i5ks5tS31BgaJpZM4QWxt9> .

ionelmc · 2018-08-17T17:02:50Z

Oh damn it's almost a year.

So now I'm thinking about a compromise ... eg, always compute p90, p95, p99, p99.9 and p99.99 and compute and store on demand anything else. Tho it doesn't seem like anyone wants anything else other than p90, p95, p99, p99.9 and p99.99, so just remove the on-demand thing? What do you think @jdhardy? (just asking for opinion, not pr rework)

jdhardy · 2018-08-28T15:10:58Z

I'm actually (finally) circling back on this in the next week or two. I'm fine with always doing p90/95/99/99.9/99.99 to start with and then seeing if anyone wants anything else. That should cover the vast majority of use cases.

ionelmc · 2019-01-03T18:42:25Z

@jdhardy hey, you still wanna work on this?

jdhardy · 2019-01-04T14:25:42Z

Yes, and when I get back from vacation next week I'll see when I can fit it into my schedule.

…

On Thu, Jan 3, 2019 at 10:42 AM Ionel Cristian Mărieș < ***@***.***> wrote: @jdhardy <https://github.com/jdhardy> hey, you still wanna work on this? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#93 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAGmdWPDplf1vsjucMautGLclo6Z4PVmks5u_k8SgaJpZM4QWxt9> .

jdhardy · 2020-02-04T14:37:19Z

Yikes. I haven't had a chance to look at this for a while, and probably won't for a while longer (I'm no longer working on that project). Feel free to close it, and apologies for not being able to follow through.

jdhardy added 2 commits November 8, 2017 09:19

Add percentile calculations to Stats.

dcc0562

Calculate interpolted percentiles using the NIST method. This ensures that p0 == min, p50 == median, p100 == max.

ionelmc mentioned this pull request Nov 19, 2017

Allow to configure custom columns or output reported #94

Open

ionelmc added this to the v3.2.0 milestone Jan 3, 2019

ionelmc modified the milestones: v3.2.0, v3.3.0 Jan 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Percentile Support #93

Percentile Support #93

jdhardy commented Nov 8, 2017 •

edited

Loading

jdhardy commented Nov 8, 2017

ionelmc commented Nov 10, 2017 •

edited

Loading

ionelmc commented Nov 10, 2017

jdhardy commented Nov 30, 2017

ionelmc commented Feb 8, 2018

jdhardy commented Feb 8, 2018 via email

ionelmc commented Aug 17, 2018

jdhardy commented Aug 28, 2018

ionelmc commented Jan 3, 2019

jdhardy commented Jan 4, 2019 via email

jdhardy commented Feb 4, 2020

Percentile Support #93

Are you sure you want to change the base?

Percentile Support #93

Conversation

jdhardy commented Nov 8, 2017 • edited Loading

jdhardy commented Nov 8, 2017

ionelmc commented Nov 10, 2017 • edited Loading

ionelmc commented Nov 10, 2017

jdhardy commented Nov 30, 2017

ionelmc commented Feb 8, 2018

jdhardy commented Feb 8, 2018 via email

ionelmc commented Aug 17, 2018

jdhardy commented Aug 28, 2018

ionelmc commented Jan 3, 2019

jdhardy commented Jan 4, 2019 via email

jdhardy commented Feb 4, 2020

jdhardy commented Nov 8, 2017 •

edited

Loading

ionelmc commented Nov 10, 2017 •

edited

Loading