During deeply nested group-by execution, the merge buffer for the inner group-by result should be held during the outer group-by execution #3938

jihoonson · 2017-02-15T06:35:41Z

For deeply nested group-by execution, at most two merge buffers should be acquired on the broker side for the currently accumulating sequence and the underlying input sequence which is already accumulated. With the group-by strategy v2, this intermediate result is stored on a merge buffer.

Obviously, the merge buffer needs to be held until the current sequence accumulation is completed. However, in the current implementation (see CombiningSequence), the input sequence (CombiningSequence.baseSequence) is first accumulated, and then the current sequence is accumulated. The merge buffer for the input sequence is released immediately once it's accumulation is completed (BaseSequence.IteratorMaker.cleanup()), so it is not guaranteed to hold the intermediate result stored on that merge buffer until the current sequence is completely accumulated.

The text was updated successfully, but these errors were encountered:

jon-wei · 2017-02-16T00:23:38Z

looking into this

jihoonson · 2017-02-16T03:47:28Z

I found that this rarely happens only when nested queries have a simple count aggregation like select count(*) from (select d1, sum(m1) from foo group by d1) t. And in this case, it seems normal because the simple count aggregation needs just merging inputs instead of accumulating them.
I'll close this issue.

gianm · 2017-02-16T04:56:41Z

@jihoonson are you saying the behavior you saw is not actually a bug?

jihoonson · 2017-02-16T05:15:21Z

Yes, it seems normal.
More precisely speaking, in CombiningSequence.accumulation(), baseSequence.accumulate(null, combiningAccumulator) consumes only a single row because the simple count aggregation always produces a single row. And in this case, CombiningAccumulator.accumulate() always executes the below code which doesn't have to acquire a merge buffer.

if (prevValue == null) {
  return mergeFn.apply(t, null);
}

Thus, even though the merge buffer of the input sequence is released before the current sequence is accumulated, the intermediate result of the input sequence accumulation is kept as lastValue which is the result of baseSequence.accumulate(null, combiningAccumulator) in CombiningSequence.accumulation().

jihoonson · 2017-02-16T05:22:16Z

You can reproduce this by checking out https://github.com/jihoonson/druid/tree/merge-buffer-debug and running GroupByQueryMergeBufferTest. The tests in this class of this branch emit some logs which show what happens in sequence.

gianm added this to the 0.10.0 milestone Feb 15, 2017

gianm added the Bug label Feb 15, 2017

jihoonson closed this as completed Feb 16, 2017

This was referenced Sep 5, 2021

[Snyk] Security upgrade axios from 0.18.0 to 0.21.3 ajesse11x/incubator-druid#229

Open

[Snyk] Security upgrade axios from 0.21.1 to 0.21.3 terrorizer1980/druid#11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

During deeply nested group-by execution, the merge buffer for the inner group-by result should be held during the outer group-by execution #3938

During deeply nested group-by execution, the merge buffer for the inner group-by result should be held during the outer group-by execution #3938

jihoonson commented Feb 15, 2017 •

edited

Loading

jon-wei commented Feb 16, 2017

jihoonson commented Feb 16, 2017

gianm commented Feb 16, 2017

jihoonson commented Feb 16, 2017

jihoonson commented Feb 16, 2017

During deeply nested group-by execution, the merge buffer for the inner group-by result should be held during the outer group-by execution #3938

During deeply nested group-by execution, the merge buffer for the inner group-by result should be held during the outer group-by execution #3938

Comments

jihoonson commented Feb 15, 2017 • edited Loading

jon-wei commented Feb 16, 2017

jihoonson commented Feb 16, 2017

gianm commented Feb 16, 2017

jihoonson commented Feb 16, 2017

jihoonson commented Feb 16, 2017

jihoonson commented Feb 15, 2017 •

edited

Loading