Generalize algorithm links per language in README #14

scotts · 2020-07-06T20:02:49Z

In the current README, when we mention a particular algorithm, we link to its C++ implementation. This does not scale as we add implementations in new languages. We should come up with a format that makes it easier to easily point readers to implementations in a language they are interested in.

segeljakt · 2020-07-07T08:39:44Z

Maybe a capability matrix like https://beam.apache.org/documentation/runners/capability-matrix/ could be useful. I'm not sure how easy it would be to write it in markdown though.

I think it would also be very interesting to have a benchmark between the implementations.

scotts · 2020-07-10T21:47:13Z

I think a capability matrix is a useful way to categorize the algorithms, as we have several dimensions we care about (operations supported; in-order required; algorithmic analysis; space required; implementations). I'm worried, thought, that is a lot to parse up-front. It might be easier to maintain and read it in a more prose-like style rather than in one large table. Maybe something like (these are all bogus URLs):

DABA

full name: De-Amortized Banker's Aggregator
ordering: FIFO
operator requirements: associativity
time complexity: worst-case O(1)
space requirements: 2n
first appeared: Low-Latency Sliding-Window Aggregation in Worst-Case Constant Time
implementions: C++, Rust

FiBA

full name: Finger B-Tree Aggregator
ordering: out-of-order allowed, requires timestamps
operator requirements: associativity
time complexity: worst-case O(log n d) where d is distance from being in-order; reduces to worst-case O(log n) when d=0; and average-case O(log d) and reduces to average-case O(1) when d=0
space requirements: O(n)
first appeared: Optimal and General Out-of-Order Sliding-Window Aggregation
implementions: C++, Rust, Java

scotts · 2020-07-10T21:47:48Z

Also, I think what to do about benchmarks warrants a new issue.

Expand and create systematic listing of algorithms #14

hirzel · 2020-08-11T20:59:30Z

This looks great so far! A few tweaks:

Instead of citing Jon Skeet on Stack Overflow we should cite adamax on Stack Overflow. The former only presents a LIFO stack, whereas the latter presents a FIFO queue.
Instead of saying FiBA requires space n, I would say it requires space O(n). The exact constant factor depends on the arity.
For SOE, we can say the time is worst-case O(1). This is of course predicated on the combine function being O(1), but we also assume that elsewhere. Furthermore, it is predicated on using a chunked-array deque as the underlying backing store.

hirzel · 2020-08-11T21:07:13Z

One more thing I just noticed. We are saying "out-of-order allowed" for Recalc and for SOE. However, I think we currently only provide FIFO implementations for those two algorithms.

scotts · 2020-08-11T21:49:39Z

However, I think we currently only provide FIFO implementations for those two algorithms.

I thought about that, too. I think maybe we should make the qualification when we link to the implementation and keep the SWAG itself general.

ktangwon · 2020-08-13T16:13:55Z

This is looking good. Right now, the README file uses the terms average-case and worst-case to describe algorithms' running time. However, the term "amortized" (e.g., amortized O(log n) and amortized O(1)) would be a more precise characterization of the notion of averaging we use.

scotts · 2020-08-13T21:36:04Z

@ktangwon, what did we say the space requirements are for IOA?

ktangwon · 2020-08-14T02:29:12Z

@ktangwon, what did we say the space requirements are for IOA?

IOA keeps 2n partial aggregates (same as DABA) and additionally keeps a few extra pointers (between nodes and for suspended computations). It is hard to pin down the exact constant, but we can safely say IOA requires O(n) space.

Add IOA's space requirements #14

scotts added the documentation Improvements or additions to documentation label Jul 6, 2020

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 10, 2020

Expand and create systematic listing of algorithms IBM#14

c26d6f1

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 10, 2020

Expand and create systematic listing of algorithms IBM#14

c15d842

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 10, 2020

Expand and create systematic listing of algorithms IBM#14

d51a99e

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 10, 2020

Expand and create systematic listing of algorithms IBM#14

e6f1740

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 10, 2020

Expand and create systematic listing of algorithms IBM#14

254b411

scotts added a commit that referenced this issue Aug 10, 2020

Merge pull request #28 from scotts/master

e725669

Expand and create systematic listing of algorithms #14

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 13, 2020

Change "average-case" to "amortized" IBM#14

ac2ed81

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 13, 2020

Fix attribution of Two-Stacks IBM#14

097fee5

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 13, 2020

Fix FiBA space requirements IBM#14

b730961

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 13, 2020

Fix SOE description IBM#14

571ab11

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 13, 2020

Fix Recalc and SOE space requirements to be plain n IBM#14

1188ca3

scotts pushed a commit to scotts/sliding-window-aggregators that referenced this issue Aug 14, 2020

Add IOA's space requirements IBM#14

7c06bfc

scotts added a commit that referenced this issue Aug 14, 2020

Merge pull request #30 from scotts/master

01b8404

Add IOA's space requirements #14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize algorithm links per language in README #14

Generalize algorithm links per language in README #14

scotts commented Jul 6, 2020

segeljakt commented Jul 7, 2020 •

edited

Loading

scotts commented Jul 10, 2020

scotts commented Jul 10, 2020

hirzel commented Aug 11, 2020

hirzel commented Aug 11, 2020

scotts commented Aug 11, 2020

ktangwon commented Aug 13, 2020

scotts commented Aug 13, 2020

ktangwon commented Aug 14, 2020

Generalize algorithm links per language in README #14

Generalize algorithm links per language in README #14

Comments

scotts commented Jul 6, 2020

segeljakt commented Jul 7, 2020 • edited Loading

scotts commented Jul 10, 2020

DABA

FiBA

scotts commented Jul 10, 2020

hirzel commented Aug 11, 2020

hirzel commented Aug 11, 2020

scotts commented Aug 11, 2020

ktangwon commented Aug 13, 2020

scotts commented Aug 13, 2020

ktangwon commented Aug 14, 2020

segeljakt commented Jul 7, 2020 •

edited

Loading