optimize Generated{Abstract,Product}Algebra with benchmarking #591

sritchie · 2016-12-01T19:57:29Z

This PR

Adds a benchmark for Tuple4 (seemed like a nice one to pick :)
corrects some bad performance in the generated sumOption implementations that this benchmark uncovered.

Run the benchmark with:

sbt algebird-benchmark/jmh:run -t1 -f 1 -wi 5 -i 15 .*Tuple4Benchmark.*

Benchmark Results

Before the changes:

[info] Benchmark                             (numElements)   Mode  Cnt     Score     Error  Units
[info] Tuple4Benchmark.timeProductPlus               10000  thrpt   15  1870.581 ± 149.159  ops/s
[info] Tuple4Benchmark.timeProductSumOption          10000  thrpt   15   492.128 ±  29.016  ops/s
[info] Tuple4Benchmark.timeTuplePlus                 10000  thrpt   15  3911.323 ± 106.611  ops/s
[info] Tuple4Benchmark.timeTupleSumOption            10000  thrpt   15  2682.713 ±  39.312  ops/s

After the changes:

[info] Benchmark                             (numElements)   Mode  Cnt     Score     Error  Units
[info] Tuple4Benchmark.timeProductPlus               10000  thrpt   15  1934.965 ±  50.165  ops/s
[info] Tuple4Benchmark.timeProductSumOption          10000  thrpt   15  2896.808 ±  71.128  ops/s
[info] Tuple4Benchmark.timeTuplePlus                 10000  thrpt   15  3893.420 ± 139.511  ops/s
[info] Tuple4Benchmark.timeTupleSumOption            10000  thrpt   15  4073.248 ±  60.992  ops/s
[success] Total time: 88 s, completed Dec 1, 2016 12:48:43 PM

so ~6x improvement for the ProductN semigroups, 1.5x improvement for the tupleN semigroups. AND, more importantly, they're all more efficient than just using plus.

Note that benchmarks only adds longs, so it measures the improvement of sumOption on tuples when the tuple elements don't have an efficient sumOption implementation. When they do, these sumOption implementations will be that much faster.

sritchie · 2016-12-01T19:57:51Z

build.sbt

@@ -166,6 +166,7 @@ lazy val algebird = Project(
  base = file("."),
  settings = sharedSettings)
  .settings(noPublishSettings)
+  .settings(coverageExcludedPackages := "<empty>;.*\\.benchmark\\..*")


gotta increase the coverage!

codecov-io · 2016-12-01T20:52:47Z

Current coverage is 81.03% (diff: 100%)

Merging #591 into develop will increase coverage by 6.85%

@@            develop       #591   diff @@
==========================================
  Files           122        113     -9   
  Lines          4694       4903   +209   
  Methods        4268       4517   +249   
  Messages          0          0          
  Branches        387        385     -2   
==========================================
+ Hits           3482       3973   +491   
+ Misses         1212        930   -282   
  Partials          0          0

Powered by Codecov. Last update 87cbf25...83cb518

sritchie-stripe · 2016-12-01T21:03:57Z

build.sbt

@@ -216,6 +216,7 @@ lazy val algebirdTest = module("test").settings(
 ).dependsOn(algebirdCore)

 lazy val algebirdBenchmark = module("benchmark").settings(JmhPlugin.projectSettings:_*).settings(
+   coverageExcludedPackages := "com\\.twitter\\.algebird\\.benchmark.*",


this is hurting our coverage!

oscar-stripe · 2016-12-01T22:37:24Z

algebird-core/src/main/scala/com/twitter/algebird/BufferedOperation.scala

+   * the `sumOption` implementation of the supplied Semigroup[T]
+   */
+  def fromSumOption[T](size: Int)(implicit sg: Semigroup[T]) =
+    new ArrayBufferedOperation[T, T](1000) with BufferedReduce[T] {


aren't you ignoring size?

oscar-stripe · 2016-12-01T22:37:53Z

algebird-core/src/main/scala/com/twitter/algebird/BufferedOperation.scala

+   * Returns an ArrayBufferedOperation instance that internally uses
+   * the `sumOption` implementation of the supplied Semigroup[T]
+   */
+  def fromSumOption[T](size: Int)(implicit sg: Semigroup[T]) =


can we put a narrow return type on this? Maybe BufferedReduce[T]?

sritchie · 2016-12-01T22:57:47Z

@johnynek ah, nice catches. Was letting the coffee get to me. All fixed up.

isnotinvain · 2016-12-01T23:26:01Z

LGTM

Any idea why ignoring size didn't trigger a test failure? Is that just a size hint for performance?

Does this now always use sumOption, even when plus would normally be called? I wonder if any plus(x,y) implementations are more efficient than sumOption(Seq(x,y)) in the case of only adding 2 items together? (eg, map monoid's sumOption creating a whole mutable hash map in sumOption)

sritchie-stripe · 2016-12-02T00:22:06Z

@isnotinvain, my mistake was that I left a value of 1000 hardcoded, instead of respecting the size parameter. The Tuple and Record sumOption implementations also pass a value of 1000, so there was no difference.

All this PR does is add a more efficient implementation for anything that explicitly calls sumOption - the plus logic still works exactly like it did before. In fact, the benchmark calls out the difference between using plus explicitly vs sumOption for larger collections.

If you call sumOption on a collection of two things, it's true that it might be more efficient to just add them together with plus - sumOption operates on TraversableOnce, though, so it can't know that information. It's up to the caller to know the difference.

Thanks for the LGTM! Tests timed out, so waiting for the build to complete again and then I'll merge and continue with the averaged value implementation.

isnotinvain

thanks for explaining!

johnynek · 2016-12-02T20:34:58Z

so, I'm worried about this.

On a deep tuple are we exponentially growing buffers? Previously we were not (I don't think). You had a linear number of buffers. Can we stop and document why this is not exponential here?

isnotinvain · 2016-12-03T01:19:14Z

Sorry, looks like I pulled the trigger too soon. Did this go into the release?
Do we need to back it out :( ?

sritchie-stripe · 2016-12-06T14:39:20Z

hey, sorry @isnotinvain for not updating the ticket! @johnynek and I talked offline when releasing and decided that this is NOT an issue. Buffers are going to grow linearly with the depth if you have nested tuples, which is fine.

If I have a tuple2 of tuple2s, The other tuple 2 will create 2 buffers, one for each inner tuple - each inner tuple will then create 2 buffers as well. I think this is totally fine, and how it has to work. Does that sound right?

Glad we thought this through!

isnotinvain · 2016-12-06T23:21:56Z

Yeah SGTM!

sritchie added enhancement performance labels Dec 1, 2016

sritchie commented Dec 1, 2016

View reviewed changes

improve tuple performance

00a457d

sritchie force-pushed the sritchie/tuple_performance branch from 1122d43 to 00a457d Compare December 1, 2016 20:02

actually exclude benchmark from coverage

d3f5774

sritchie-stripe reviewed Dec 1, 2016

View reviewed changes

sritchie mentioned this pull request Dec 1, 2016

Coverage and documentation for AveragedValue #589

Merged

oscar-stripe reviewed Dec 1, 2016

View reviewed changes

Address Oscar's comment

83cb518

isnotinvain approved these changes Dec 2, 2016

View reviewed changes

isnotinvain merged commit 0ff8bfe into develop Dec 2, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize Generated{Abstract,Product}Algebra with benchmarking #591

optimize Generated{Abstract,Product}Algebra with benchmarking #591

sritchie commented Dec 1, 2016 •

edited

Loading

sritchie Dec 1, 2016

codecov-io commented Dec 1, 2016 •

edited

Loading

sritchie-stripe Dec 1, 2016

oscar-stripe Dec 1, 2016

oscar-stripe Dec 1, 2016

sritchie commented Dec 1, 2016

isnotinvain commented Dec 1, 2016

sritchie-stripe commented Dec 2, 2016

isnotinvain left a comment

johnynek commented Dec 2, 2016

isnotinvain commented Dec 3, 2016

sritchie-stripe commented Dec 6, 2016

isnotinvain commented Dec 6, 2016

optimize Generated{Abstract,Product}Algebra with benchmarking #591

optimize Generated{Abstract,Product}Algebra with benchmarking #591

Conversation

sritchie commented Dec 1, 2016 • edited Loading

Benchmark Results

sritchie Dec 1, 2016

Choose a reason for hiding this comment

codecov-io commented Dec 1, 2016 • edited Loading

Current coverage is 81.03% (diff: 100%)

sritchie-stripe Dec 1, 2016

Choose a reason for hiding this comment

oscar-stripe Dec 1, 2016

Choose a reason for hiding this comment

oscar-stripe Dec 1, 2016

Choose a reason for hiding this comment

sritchie commented Dec 1, 2016

isnotinvain commented Dec 1, 2016

sritchie-stripe commented Dec 2, 2016

isnotinvain left a comment

Choose a reason for hiding this comment

johnynek commented Dec 2, 2016

isnotinvain commented Dec 3, 2016

sritchie-stripe commented Dec 6, 2016

isnotinvain commented Dec 6, 2016

sritchie commented Dec 1, 2016 •

edited

Loading

codecov-io commented Dec 1, 2016 •

edited

Loading