Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize the num of stats load items for large SQL and wide table #54883

Open
winoros opened this issue Jul 24, 2024 · 0 comments
Open

Optimize the num of stats load items for large SQL and wide table #54883

winoros opened this issue Jul 24, 2024 · 0 comments
Labels
component/statistics sig/planner SIG: Planner type/enhancement The issue or PR belongs to an enhancement.

Comments

@winoros
Copy link
Member

winoros commented Jul 24, 2024

Enhancement

After the collecting of predicate columns becomes the default, we can change the way to decide which columns' statistics need to be loaded.

The columns that come from the WHERE clause need to read their full stats(NDV, histogram, topn). The columns that come from JOIN keys/group-by clause/order-by/partition-by only need their NDV information.

So we can reduce the size of the stats objects that we need to load to memory.

@winoros winoros added type/enhancement The issue or PR belongs to an enhancement. sig/planner SIG: Planner labels Jul 24, 2024
ti-chi-bot bot pushed a commit that referenced this issue Jul 25, 2024
@Rustin170506 Rustin170506 moved this to 📋 Backlog in 😆Stats Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/statistics sig/planner SIG: Planner type/enhancement The issue or PR belongs to an enhancement.
Projects
None yet
Development

No branches or pull requests

2 participants