Improve treeHasher performance #42

cuonglm · 2021-07-13T09:32:02Z

By doing two things:

Pre-allocated enough size for slice, instead of initializing a small
one then continuously appending to it.
Make placeholder slice re-usable, only extending the size when the
current hasher size is greater than current one.

That helps improves the sppeed, and less allocations for Update/Delete
operations:

name                       old time/op    new time/op    delta
SparseMerkleTree_Update-8    11.9µs ± 3%    11.2µs ± 3%   -5.89%  (p=0.008 n=5+5)
SparseMerkleTree_Delete-8    9.66µs ± 2%    5.40µs ± 2%  -44.12%  (p=0.008 n=5+5)

name                       old alloc/op   new alloc/op   delta
SparseMerkleTree_Update-8    17.7kB ± 0%    16.1kB ± 1%   -9.42%  (p=0.016 n=4+5)
SparseMerkleTree_Delete-8    16.3kB ± 0%    13.9kB ± 0%  -14.82%  (p=0.008 n=5+5)

name                       old allocs/op  new allocs/op  delta
SparseMerkleTree_Update-8       117 ± 0%        63 ± 0%  -46.15%  (p=0.029 n=4+4)
SparseMerkleTree_Delete-8      94.4 ± 1%      28.6 ± 2%  -69.70%  (p=0.008 n=5+5)

cuonglm · 2021-07-13T09:32:37Z

cc @odeke-em

codecov-commenter · 2021-07-13T13:05:48Z

Codecov Report

Merging #42 (2066960) into master (072abf0) will not change coverage.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master      #42   +/-   ##
=======================================
  Coverage   86.05%   86.05%           
=======================================
  Files           6        6           
  Lines         466      466           
=======================================
  Hits          401      401           
  Misses         37       37           
  Partials       28       28

Impacted Files	Coverage Δ
treehasher.go	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 072abf0...2066960. Read the comment docs.

odeke-em

LGTM, thank you @cuonglm! Let's also see if we can shave off by reusing the hashers

cuonglm · 2021-07-13T15:19:58Z

LGTM, thank you @cuonglm! Let's also see if we can shave off by reusing the hashers

Yeah, I think it can, but want to keep the PR as small as possible.

odeke-em · 2021-07-13T20:45:40Z

LGTM, thank you @cuonglm! Let's also see if we can shave off by reusing the hashers

Yeah, I think it can, but want to keep the PR as small as possible.

Absolutely, I meant to write/add "later on"

liamsi

Besides a minor nit, looks good to me 👍 thanks for this optimisation!

treehasher.go

bench_test.go

By doing two things: - Pre-allocated enough size for slice, instead of initializing a small one then continuously appending to it. - Initializing placeholder slice once instead of creating new one everytime treeHasher.placeholder is called. That helps improves the sppeed, and less allocations for Update/Delete operations: name old time/op new time/op delta SparseMerkleTree_Update-8 11.9µs ± 3% 11.2µs ± 3% -5.89% (p=0.008 n=5+5) SparseMerkleTree_Delete-8 9.66µs ± 2% 5.40µs ± 2% -44.12% (p=0.008 n=5+5) name old alloc/op new alloc/op delta SparseMerkleTree_Update-8 17.7kB ± 0% 16.1kB ± 1% -9.42% (p=0.016 n=4+5) SparseMerkleTree_Delete-8 16.3kB ± 0% 13.9kB ± 0% -14.82% (p=0.008 n=5+5) name old allocs/op new allocs/op delta SparseMerkleTree_Update-8 117 ± 0% 63 ± 0% -46.15% (p=0.029 n=4+4) SparseMerkleTree_Delete-8 94.4 ± 1% 28.6 ± 2% -69.70% (p=0.008 n=5+5)

tac0turtle

nice!!

robert-zaremba · 2021-08-17T19:07:52Z

treehasher.go

-	copy(value, leafPrefix)
-
+	value := make([]byte, 0, len(leafPrefix)+len(path)+len(leafData))
+	value = append(value, leafPrefix...)


copy should be faster than append

@robert-zaremba It should be the same if we pre-allocate the slice, which we did here.

as far as I remember copy is faster. Maybe something changed in the recent Go releases.

As far as I remember it will literally call the same code under the hood if the slice is pre-allocated. Append is just slightly more idiomatic. Especially, if it is chained / composed of multiple appends. I'm happy to verify this with a benchmark.

@robert-zaremba Here's a quick prove:

package main import "testing" var data = [512]int{} func BenchmarkAppend(b *testing.B) { for i := 0; i < b.N; i++ { s := make([]int, 0, 512) s = append(s, data[:]...) } } func BenchmarkCopy(b *testing.B) { for i := 0; i < b.N; i++ { s := make([]int, 512) copy(s, data[:]) } }

The result:

goos: darwin goarch: arm64 BenchmarkAppend-8 8270780 136.6 ns/op 0 B/op 0 allocs/op BenchmarkCopy-8 8787560 136.6 ns/op 0 B/op 0 allocs/op PASS ok command-line-arguments 2.793s

The result can be varied between each run, but just several ns.

very good. Thanks @cuonglm

odeke-em approved these changes Jul 13, 2021

View reviewed changes

adlerjohn requested review from liamsi, adlerjohn and musalbas July 14, 2021 01:39

liamsi approved these changes Jul 14, 2021

View reviewed changes

treehasher.go Outdated Show resolved Hide resolved

adlerjohn reviewed Jul 14, 2021

View reviewed changes

treehasher.go Outdated Show resolved Hide resolved

cuonglm requested a review from adlerjohn July 14, 2021 03:12

cuonglm force-pushed the cuonglm/improve-treehasher branch from da97295 to 6098408 Compare July 14, 2021 03:13

odeke-em reviewed Jul 14, 2021

View reviewed changes

bench_test.go Show resolved Hide resolved

cuonglm force-pushed the cuonglm/improve-treehasher branch from 6098408 to 2066960 Compare July 14, 2021 06:49

tac0turtle approved these changes Jul 14, 2021

View reviewed changes

liamsi merged commit a431f73 into celestiaorg:master Jul 14, 2021

odeke-em deleted the cuonglm/improve-treehasher branch July 14, 2021 08:42

musalbas mentioned this pull request Jul 14, 2021

Remove or move bench test #45

Closed

robert-zaremba reviewed Aug 17, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve treeHasher performance #42

Improve treeHasher performance #42

cuonglm commented Jul 13, 2021 •

edited

Loading

cuonglm commented Jul 13, 2021

codecov-commenter commented Jul 13, 2021 •

edited

Loading

odeke-em left a comment

cuonglm commented Jul 13, 2021

odeke-em commented Jul 13, 2021

liamsi left a comment •

edited

Loading

tac0turtle left a comment

robert-zaremba Aug 17, 2021

cuonglm Aug 17, 2021

robert-zaremba Aug 17, 2021

liamsi Aug 17, 2021

cuonglm Aug 18, 2021 •

edited

Loading

robert-zaremba Aug 24, 2021

Improve treeHasher performance #42

Improve treeHasher performance #42

Conversation

cuonglm commented Jul 13, 2021 • edited Loading

cuonglm commented Jul 13, 2021

codecov-commenter commented Jul 13, 2021 • edited Loading

Codecov Report

odeke-em left a comment

Choose a reason for hiding this comment

cuonglm commented Jul 13, 2021

odeke-em commented Jul 13, 2021

liamsi left a comment • edited Loading

Choose a reason for hiding this comment

tac0turtle left a comment

Choose a reason for hiding this comment

robert-zaremba Aug 17, 2021

Choose a reason for hiding this comment

cuonglm Aug 17, 2021

Choose a reason for hiding this comment

robert-zaremba Aug 17, 2021

Choose a reason for hiding this comment

liamsi Aug 17, 2021

Choose a reason for hiding this comment

cuonglm Aug 18, 2021 • edited Loading

Choose a reason for hiding this comment

robert-zaremba Aug 24, 2021

Choose a reason for hiding this comment

cuonglm commented Jul 13, 2021 •

edited

Loading

codecov-commenter commented Jul 13, 2021 •

edited

Loading

liamsi left a comment •

edited

Loading

cuonglm Aug 18, 2021 •

edited

Loading