Improving optimizer performance by eliminating unnecessary sort and distribution passes, add more SymmetricHashJoin improvements #5754

metesynnada · 2023-03-27T11:53:44Z

Which issue does this PR close?

Closes #5715.

Rationale for this change

The current implementation of SymmetricHashJoin requires order information before the PipelineFixer. This limitation results in unnecessary sort and distribution enforcement requirements that impact the optimizer's performance. To overcome this issue, we have revamped SymmetricHashJoin to eliminate the need for order information before PipelineFixer.

What changes are included in this PR?

We have modified SymmetricHashJoin to function without requiring order information. The new implementation does not raise errors without order or filter information, though pruning is not supported without order information. Furthermore, we have set the required_input_ordering API to None, enabling the source to provide order information for piped executions. With this approach, we can genuinely remove the dependency on EnforceSorting and EnforceDistribution from PipelineFixer.

Are these changes tested?

Yes.

I encountered an erratic deadlock in the unbounded_file_with_symmetric_join test due to the current nature of the list_files_for_scan API in ListingTable. This API currently opens the file for metadata verification using the head() API in object_store, which can be problematic while handling FIFO files. However, the upcoming release of object_store will eliminate the need to open a file for metadata retrieval, resolving this issue. For now, I will temporarily bypass this test to focus on other aspects of the PR.

Are there any user-facing changes?

A new configuration option (allow_symmetric_joins_without_pruning) allows the user to constrain SymmetricHashJoins to run only on ordered inputs—no breaking changes in existing use cases.

…HJ code simplifications

metesynnada · 2023-04-02T19:36:31Z

Hi @alamb, I think it is ready for review. I hope it will contribute to the query planning performance.

alamb · 2023-04-02T20:09:53Z

Thanks @metesynnada -- I will put this on my review list for tomorrow

alamb

Thank you @metesynnada -- I looked at this PR and the struture looks good to me.

I am not familiar with the pipeline fixer code so I did not review that portion. @mustafasrepo can you please review that code (if you have not already done so). If it is good with you I think we'll be good to merge this PR

I had some small comments but nothing that is required in my opinion

alamb · 2023-04-03T13:09:39Z

datafusion/common/src/config.rs

@@ -280,6 +280,10 @@ config_namespace! {
        /// using the provided `target_partitions` level
        pub repartition_joins: bool, default = true

+        /// Should DataFusion allow symmetric hash joins for unbounded data sources even when
+        /// its inputs do not have any ordering or filtering
+        pub allow_symmetric_joins_without_pruning: bool, default = true


I don't understand how a symmetric hash join could generate correct results when the inputs don't have any ordering 🤔 Maybe we can add some additional comments about under what circumstances one would enable
/ disable this option.

SHJ will always produce correct results, but it will use twice as much memory (assuming inputs are of the same size) for no gain except pipelining.

Some more explanation about this option: It is not always possible to detect 100% accurately whether pruning may occur or not -- the system may think pruning is not possible where it is actually possible. Therefore, one would enable this option if they have a-priori knowledge that data would indeed lend itself to pruning.

Thank you -- this explanation and the updated comments help to clarify

alamb · 2023-04-03T13:12:00Z

datafusion/core/src/execution/context.rs

@@ -1293,9 +1293,6 @@ impl SessionState {
            // repartitioning and local sorting steps to meet distribution and ordering requirements.
            // Therefore, it should run before EnforceDistribution and EnforceSorting.
            Arc::new(JoinSelection::new()),
-            // Enforce sort before PipelineFixer


mingmwang · 2023-04-03T13:26:08Z

Can we have different physical optimizers list for the plans with/without unbounded sources?
And I think the bound/unbounded source should an attribute or method for Source Operators only.

mingmwang · 2023-04-03T13:43:20Z

https://github.com/apache/flink/blob/master/flink-core/src/main/java/org/apache/flink/api/connector/source/Boundedness.java
https://github.com/apache/flink/blob/master/flink-core/src/main/java/org/apache/flink/api/connector/source/Source.java
https://github.com/apache/flink/blob/master/flink-core/src/main/java/org/apache/flink/api/common/RuntimeExecutionMode.java

metesynnada · 2023-04-03T15:45:05Z

Can we have a different physical optimizers list for the plans with/without unbounded sources?

I think this would cause problems while we are using bounded and unbounded sources together in the same query.

And I think the bound/unbounded source should be an attribute or method for Source Operators only.

Assigning the responsibility to each ExecutionPlan to determine whether its input is unbounded or not, similar to order/distribution information, seems to be the optimal strategy for unifying unbounded and bounded execution. This approach maintains a separation of concerns and empowers us to make atomic decisions with our best effort.

Attempting to solve this problem globally may lead to inflexible design patterns and technical debt. Nonetheless, we currently have the capability to optimize and handle complex queries with a combination of both unbounded and bounded sources, which is a robust solution.

alamb · 2023-04-03T15:52:08Z

Can we have different physical optimizers list for the plans with/without unbounded sources?

This would make sense to me if there are different query plans / decisions that would be made in the two modes. It seems like the decision can be made locally at the moment by inspecting the plans / plan nodes, as suggested by @metesynnada

@mingmwang do you have some ideas about when a global mode would be more beneficial?

ozankabak · 2023-04-03T17:34:06Z

One can not achieve unified processing (where you can freely use streams and tables together) with the global mode route. With all the streaming improvements coming in, the fact that Datafusion can do this transparently with just standard SQL is an attractive differentiator and innovation IMO.

mustafasrepo · 2023-04-03T19:00:14Z

@alamb I have reviewed this PR, in our internal repo. This PR is LGTM!.

alamb

Thank you @metesynnada and @ozankabak and @mustafasrepo -- I will plan to merge this PR tomorrow unless there are any other comments

metesynnada and others added 7 commits March 22, 2023 15:58

Increase optimizer performance

b25bc46

Config added.

b5eb32b

Simplifications and comment improvements

575a6d7

More simplifications

05f768c

Revamping tests for unbounded-unbounded cases.

90d82df

Review code

8886457

Move SHJ suitability from PipelineFixer to PipelineChecker, further S…

36d450e

…HJ code simplifications

github-actions bot added core Core DataFusion crate physical-expr Physical Expressions sqllogictest SQL Logic Tests (.slt) labels Mar 27, 2023

metesynnada changed the title ~~Performance/remove enforcesorting~~ Improving optimizer performance by eliminating unnecessary sort and distribution requirements Mar 27, 2023

ozankabak and others added 2 commits March 27, 2023 16:39

Merge branch 'main' into performance/remove-enforcesorting

5df5e05

Added logging on tests and ensure timeout

119a870

metesynnada marked this pull request as draft March 29, 2023 06:55

metesynnada added 12 commits March 29, 2023 10:31

Robust fifo writing in case of slow executions

a83a284

Update fifo.rs

8799000

Update fifo.rs

02bd036

Update fifo.rs

05497ef

Update fifo.rs

311a891

Get rid of locks

b051efb

Try exact one batch size

c0ecbe4

Update fifo.rs

e6ab621

Update fifo.rs

7cd6b1e

Update fifo.rs

9fa0c71

Merge branch 'main' into performance/remove-enforcesorting

0a2d35f

Ignore FIFO test

c9bece5

metesynnada marked this pull request as ready for review April 2, 2023 19:35

Merge branch 'main' into performance/remove-enforcesorting

417e2f1

alamb reviewed Apr 3, 2023

View reviewed changes

alamb changed the title ~~Improving optimizer performance by eliminating unnecessary sort and distribution requirements~~ Improving optimizer performance by eliminating unnecessary sort and distribution passes, add more SymmetricHashJoin improvements Apr 3, 2023

Update config.rs

331851b

metesynnada added 5 commits April 3, 2023 20:35

Config update

f962643

Update config.rs

2c15d4e

Update configs.md

363960f

Update config

61c3434

Update symmetric_hash_join.rs

7d32c12

alamb approved these changes Apr 3, 2023

View reviewed changes

alamb merged commit d6c2233 into apache:main Apr 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving optimizer performance by eliminating unnecessary sort and distribution passes, add more SymmetricHashJoin improvements #5754

Improving optimizer performance by eliminating unnecessary sort and distribution passes, add more SymmetricHashJoin improvements #5754

metesynnada commented Mar 27, 2023 •

edited

Loading

metesynnada commented Apr 2, 2023

alamb commented Apr 2, 2023

alamb left a comment

alamb Apr 3, 2023

ozankabak Apr 3, 2023

alamb Apr 3, 2023

alamb Apr 3, 2023

mingmwang commented Apr 3, 2023

mingmwang commented Apr 3, 2023

metesynnada commented Apr 3, 2023

alamb commented Apr 3, 2023

ozankabak commented Apr 3, 2023 •

edited

Loading

mustafasrepo commented Apr 3, 2023

alamb left a comment

Improving optimizer performance by eliminating unnecessary sort and distribution passes, add more SymmetricHashJoin improvements #5754

Improving optimizer performance by eliminating unnecessary sort and distribution passes, add more SymmetricHashJoin improvements #5754

Conversation

metesynnada commented Mar 27, 2023 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

metesynnada commented Apr 2, 2023

alamb commented Apr 2, 2023

alamb left a comment

Choose a reason for hiding this comment

alamb Apr 3, 2023

Choose a reason for hiding this comment

ozankabak Apr 3, 2023

Choose a reason for hiding this comment

alamb Apr 3, 2023

Choose a reason for hiding this comment

alamb Apr 3, 2023

Choose a reason for hiding this comment

mingmwang commented Apr 3, 2023

mingmwang commented Apr 3, 2023

metesynnada commented Apr 3, 2023

alamb commented Apr 3, 2023

ozankabak commented Apr 3, 2023 • edited Loading

mustafasrepo commented Apr 3, 2023

alamb left a comment

Choose a reason for hiding this comment

metesynnada commented Mar 27, 2023 •

edited

Loading

ozankabak commented Apr 3, 2023 •

edited

Loading