Create benchmarks for queries #13605

facundomedica · 2022-10-20T17:26:44Z

Summary

Add benchmarks that run queries against a pre-defined data directory in order to know wether a PR results in an improvement or deterioration in performance.

Problem Definition

Currently, in many cases we don't have any metrics to understand the impact of merging a PR.

Proposal

Create a series of benchmarks like this:

func BenchmarkQueryStakingDelegations(b *testing.B) {
  // create app and load state from a db directory

  for i := 0; i < b.N; i++ {
    // ...
  } 
}

Proposed by @alexanderbez

The text was updated successfully, but these errors were encountered:

alexanderbez · 2022-10-20T17:32:58Z

Love this! This will give us a good way to measure the baseline and any changes or improvements. I propose we have benchmarks for the following queries:

Balance (x/bank)
AllBalances (x/bank)
Proposals (x/gov)
Votes (x/gov)
SigningInfos (x/slashing)
ValidatorDelegations (x/staking) <- this will be a good one

We can use https://quicksync.io/networks/cosmos.html to get data directories to feed into the app

peterbourgon · 2022-10-23T04:54:57Z

Just want to node that, in order for benchmarks like these to produce results that are reliably reproducible and comparable, it's important that the state they operate on is committed to version control. A good litmus test: git checkout <rev> followed by go test -bench=whatever should always do exactly the same thing for everyone, and shouldn't rely on some script to populate test data or whatever. Snapshots from quicksync.io are (of course) enormous, so if you want to use them as a starting point that's cool, just be sure to extract and commit a (tiny) representative sample of the data.

alexanderbez · 2022-10-23T17:40:12Z

Just want to node that, in order for benchmarks like these to produce results that are reliably reproducible and comparable, it's important that the state they operate on is committed to version control.

We're not going to commit the state to version control. The state we're benchmarking on can range from GBs to TBs. What we will do instead if post the results along with the profiles and reproduction steps (including hashes of state files).

However, we can include a script that automates all of this.

peterbourgon · 2022-10-23T17:46:41Z

Exactly my point — benchmarks are meaningful when they exercise a representative, well-defined, and ultimately small sample of a codepath, so that go bench can run it over and over for the stats. Loading up GBs of data adds a ton of variance, and that makes the benchmark unreliable.

Should definitely do a test like this! But it'd be a performance soak test or something. Different class of thing. Happy to talk more about it if there's interest.

alexanderbez · 2022-10-23T18:22:43Z

The benchmarks on these varying sample sizes will give us insights into the major bottlenecks. As we discover those, we'll continue to refine existing and introduce new benchmarks into those bottleneck paths to get better understanding. So having these benchmarks on large samples will give us valuable insights.

peterbourgon · 2022-10-23T19:57:18Z

If you mean a benchmark in a broader sense, and not a go bench benchmark, then all good.

alexanderbez · 2022-10-24T15:12:09Z

I do mean go bench as a jumping off point.

peterbourgon · 2022-10-24T18:08:42Z

So go bench is built on the assumption that the code you run b.N times in a BenchmarkXxx function will be be quick (typically sub-millisecond) and consume basically the same amount of CPU and memory with each iteration. If the code under observation, like the specific CPU instructions that are called, can be substantially different from run to run, or if each run takes more than a second or two to finish (at worst!), or especially if the code does any kind of IO (disk, network, etc.) then go bench won't produce valid results. So if you can load those GBs of data up and benchmark sufficiently isolated parts of the state infrastructure then SGTM, but an e.g. iterator across all of Terra's pools (or whatever) isn't something you can run through go bench 😉

alexanderbez · 2022-10-25T13:53:41Z

We will find out 😉

Thanks for your tremendous valuable input as always 🙏

facundomedica · 2022-11-25T12:15:31Z

I had posted this in the wrong place
Final results:

go test -benchmem -timeout=120h -benchtime=120s -run=^$ -bench "^(BenchmarkQueryBalance|BenchmarkQueryAllBalances|BenchmarkQueryVotes|BenchmarkQuerySigningInfos|BenchmarkQueryValidatorDelegations)$" -memprofile memprofile.out -cpuprofile profile.out cosmossdk.io/simapp

goos: linux
goarch: amd64
pkg: cosmossdk.io/simapp
cpu: AMD Ryzen 9 5950X 16-Core Processor            
BenchmarkQueryBalance-28                 	66626565	      1993 ns/op	    1248 B/op	      23 allocs/op
BenchmarkQueryAllBalances-28             	 5418576	     26516 ns/op	   13880 B/op	     194 allocs/op
BenchmarkQueryVotes-28                   	   46556	   3026516 ns/op	 2882343 B/op	   58778 allocs/op
BenchmarkQuerySigningInfos-28            	  162439	    835128 ns/op	  830759 B/op	   14910 allocs/op
BenchmarkQueryValidatorDelegations-28    	      60	2359649179 ns/op	1735087374 B/op	28170831 allocs/op
PASS
ok  	cosmossdk.io/simapp	31174.6

The zip file contains both memory and cpu profiles
profiles.zip

facundomedica · 2022-11-25T12:21:24Z

And here we have the results of the same queries but through the HTTP interface: https://drive.google.com/file/d/1O1AB-rM095zj9to3F05M2qlhqbJ3I7PJ/view?usp=share_link

The .bin file is a result from vegeta.

facundomedica added the T: Tests label Oct 20, 2022

alexanderbez assigned facundomedica Oct 21, 2022

facundomedica mentioned this issue Oct 26, 2022

test: Add query benchmarks #13657

Closed

19 tasks

tac0turtle closed this as not planned Won't fix, can't repro, duplicate, stale Jan 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create benchmarks for queries #13605

Create benchmarks for queries #13605

facundomedica commented Oct 20, 2022

alexanderbez commented Oct 20, 2022

peterbourgon commented Oct 23, 2022 •

edited

Loading

alexanderbez commented Oct 23, 2022

peterbourgon commented Oct 23, 2022 •

edited

Loading

alexanderbez commented Oct 23, 2022

peterbourgon commented Oct 23, 2022

alexanderbez commented Oct 24, 2022

peterbourgon commented Oct 24, 2022

alexanderbez commented Oct 25, 2022

facundomedica commented Nov 25, 2022

facundomedica commented Nov 25, 2022

Create benchmarks for queries #13605

Create benchmarks for queries #13605

Comments

facundomedica commented Oct 20, 2022

Summary

Problem Definition

Proposal

alexanderbez commented Oct 20, 2022

peterbourgon commented Oct 23, 2022 • edited Loading

alexanderbez commented Oct 23, 2022

peterbourgon commented Oct 23, 2022 • edited Loading

alexanderbez commented Oct 23, 2022

peterbourgon commented Oct 23, 2022

alexanderbez commented Oct 24, 2022

peterbourgon commented Oct 24, 2022

alexanderbez commented Oct 25, 2022

facundomedica commented Nov 25, 2022

facundomedica commented Nov 25, 2022

peterbourgon commented Oct 23, 2022 •

edited

Loading

peterbourgon commented Oct 23, 2022 •

edited

Loading