Schedule dynamic filtering collecting task immediately #10868

sopel39 · 2022-01-31T20:24:44Z

In case of plan
J1
/
J2 S3
/
S1 S2

It might happen that that dynamic filtering dependencies are:
S3 => S3 => S1

With phased scheduler source stage consisting of
(J1, J2, S1) won't be scheduler until stages with S3 and S2
have finished split enumaration. However, it might happen
that S2 is waiting for dynamic filters for S3. In that case,
S3 will never complete because DFs for S3 are collected in
stage (J1, J2, S1).

This commit makes scheduling of DF collecting task immediately
which will prevent queries from deadlock.

plugin/trino-hive/src/test/java/io/trino/plugin/hive/TestHiveDynamicPartitionPruningTest.java

dain

The scheduler stuff and the tests look good to me. I don't understand the implications of the change to DynamicFilterService.

...no-iceberg/src/test/java/io/trino/plugin/iceberg/TestIcebergDynamicPartitionPruningTest.java

core/trino-main/src/main/java/io/trino/execution/scheduler/SourcePartitionedScheduler.java

core/trino-main/src/main/java/io/trino/server/DynamicFilterService.java

sopel39 · 2022-02-01T21:19:41Z

ac @raunaqmorarka @dain . I've narrowed down when phased scheduler won't start join stage immediately to only non-fixed source stages. I've also improved tests with regards to source stage partitioning

dain

looks good

core/trino-main/src/main/java/io/trino/server/DynamicFilterService.java

arhimondr

It doesn't feel like I really understand the problem. Added some questions.

It would be great if you could extract refactoring and improvements that are unrelated to have a commit that only does what's needed to address the problem.

Additionally if you could elaborate more on a problem in the commit message it would be great. Maybe you can provide an example of a query that triggers a deadlock with a description of what join distributions are, where are the stage boundaries and how dynamic filters interact with each other.

core/trino-main/src/main/java/io/trino/execution/scheduler/SourcePartitionedScheduler.java

core/trino-main/src/main/java/io/trino/server/DynamicFilterService.java

core/trino-main/src/main/java/io/trino/execution/scheduler/policy/PhasedExecutionSchedule.java

sopel39 · 2022-02-02T11:24:48Z

@arhimondr I've simplified this PR (removed refactor), keeping just needed parts. i cannot answer some outdated comments

This changes the logic from If there's any dynamic filters produced and consumed within a stage to If there's any dynamic filters produced by a stage. Am i correct? Could you please explain why is it necessary? Is it needed to cover a mix of broadcast and partitioned joins?

I've changed to logic to create collecting task if there is any lazy DF produced by stage. This is needed because there might be consumers of that lazy DF outside of stage.

From what I understand it is to prevent a non source distributed stage running a replicated join from being executed lazily. Is it correct? I wonder why is it necessary.

There won't be extra collecting task for partitioned stages. Hence, if there are any lazy DFs produced by that stage, then stage needs to be scheduled without delay

core/trino-main/src/main/java/io/trino/execution/scheduler/SqlQueryScheduler.java

core/trino-main/src/main/java/io/trino/execution/scheduler/StageScheduler.java

In case of plan J1 / \ J2 S3 / \ S1 S2 It might happen that dynamic filtering evaluation order is: S3 => S2 => S1 With phased scheduler source stage consisting of (J1, J2, S1) won't be scheduled until stages running S3 and S2 have finished split enumaration. However, it might happen that S2 is waiting for dynamic filters produced for S3. In that case, S2 will never complete because DFs for S3 are collected in stage (J1, J2, S1) which won't be scheduled until all S2 split are enumerated. This commit makes scheduling of DF collecting task immediately which will prevent queries from deadlock.

sopel39 · 2022-02-03T07:05:31Z

Thanks for review

sopel39 requested review from dain and raunaqmorarka January 31, 2022 20:24

cla-bot bot added the cla-signed label Jan 31, 2022

github-actions bot added the tests:hive label Jan 31, 2022

sopel39 force-pushed the ks/df_schedule_imm branch 11 times, most recently from 9da1326 to 697c6a1 Compare February 1, 2022 10:03

raunaqmorarka reviewed Feb 1, 2022

View reviewed changes

sopel39 force-pushed the ks/df_schedule_imm branch from 697c6a1 to 30783f4 Compare February 1, 2022 15:14

sopel39 requested a review from raunaqmorarka February 1, 2022 15:16

raunaqmorarka approved these changes Feb 1, 2022

View reviewed changes

dain reviewed Feb 1, 2022

View reviewed changes

Add missing @language annotation

526d5d0

sopel39 force-pushed the ks/df_schedule_imm branch from 30783f4 to fa077da Compare February 1, 2022 21:16

sopel39 requested review from raunaqmorarka and dain February 1, 2022 21:16

sopel39 force-pushed the ks/df_schedule_imm branch from fa077da to 79eb0ef Compare February 1, 2022 21:18

sopel39 force-pushed the ks/df_schedule_imm branch 4 times, most recently from d4b6e8a to 6dc46de Compare February 1, 2022 21:47

dain approved these changes Feb 1, 2022

View reviewed changes

core/trino-main/src/main/java/io/trino/server/DynamicFilterService.java Outdated Show resolved Hide resolved

arhimondr reviewed Feb 2, 2022

View reviewed changes

sopel39 force-pushed the ks/df_schedule_imm branch 3 times, most recently from 333e82e to d50a4c8 Compare February 2, 2022 11:18

sopel39 force-pushed the ks/df_schedule_imm branch from d50a4c8 to 7ca7808 Compare February 2, 2022 11:27

sopel39 added the RELEASE-BLOCKER label Feb 2, 2022

sopel39 force-pushed the ks/df_schedule_imm branch 2 times, most recently from 3655c3b to 30c9834 Compare February 2, 2022 23:28

sopel39 requested a review from arhimondr February 2, 2022 23:30

sopel39 force-pushed the ks/df_schedule_imm branch from 30c9834 to 0901a25 Compare February 2, 2022 23:34

arhimondr approved these changes Feb 2, 2022

View reviewed changes

core/trino-main/src/main/java/io/trino/execution/scheduler/SqlQueryScheduler.java Outdated Show resolved Hide resolved

core/trino-main/src/main/java/io/trino/execution/scheduler/StageScheduler.java Show resolved Hide resolved

sopel39 force-pushed the ks/df_schedule_imm branch from 0901a25 to bce823e Compare February 2, 2022 23:56

sopel39 merged commit 8367998 into trinodb:master Feb 3, 2022

sopel39 deleted the ks/df_schedule_imm branch February 3, 2022 06:35

sopel39 mentioned this pull request Feb 3, 2022

Release notes for 370 #10794

Closed

github-actions bot added this to the 370 milestone Feb 3, 2022

mosabua mentioned this pull request Feb 3, 2022

Add Trino 370 release notes #10793

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Schedule dynamic filtering collecting task immediately #10868

Schedule dynamic filtering collecting task immediately #10868

sopel39 commented Jan 31, 2022

dain left a comment

sopel39 commented Feb 1, 2022

dain left a comment

arhimondr left a comment

sopel39 commented Feb 2, 2022 •

edited

Loading

sopel39 commented Feb 3, 2022

Schedule dynamic filtering collecting task immediately #10868

Schedule dynamic filtering collecting task immediately #10868

Conversation

sopel39 commented Jan 31, 2022

dain left a comment

Choose a reason for hiding this comment

sopel39 commented Feb 1, 2022

dain left a comment

Choose a reason for hiding this comment

arhimondr left a comment

Choose a reason for hiding this comment

sopel39 commented Feb 2, 2022 • edited Loading

sopel39 commented Feb 3, 2022

sopel39 commented Feb 2, 2022 •

edited

Loading