Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-arrange joining on genetic_alteration table to optimize memory con… #11374

Merged
merged 1 commit into from
Feb 6, 2025

Conversation

alisman
Copy link
Contributor

@alisman alisman commented Feb 5, 2025

Creation of Clickhouse derived tables based on genetic_alteration table (unpacking delimited strings to CH rows) has been plagued by memory issues, requiring batching of the process. This PR optimizes memory consumption by obeying the "largest table on the left rule". The genetic_alteration table is by far the largest table, so it should be on the left joins. Intuitively, this allows the query optimizer to automatically batch its consumption of that table.

@alisman alisman requested a review from sheridancbio February 5, 2025 17:44
Copy link

sonarqubecloud bot commented Feb 5, 2025

@sheridancbio sheridancbio merged commit bc0e7b3 into cBioPortal:master Feb 6, 2025
23 of 26 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants