Make llm benchmarks query faster #6258

clee2000 · 2025-02-04T23:58:23Z

Make faster by adding a projection to the table

PROJECTION benchmark_name_projection
    (
        SELECT *
        ORDER BY 
            repo,
            tupleElement(benchmark, 'name'),
            tupleElement(model, 'name'),
            tupleElement(metric, 'name'),
            timestamp,
            head_branch,
            head_sha,
            workflow_id,
            job_id,
            servicelab_experiment_id,
            servicelab_trial_id
    ),

For some reason, ClickHouse won't use the projection unless t uses has instead of in and all the other tuple elements are in quotation marks

We should consider changing the order by key to be this instead since I can't imagine anyone not filtering by benchmark name

Perf results:

+------+----------+-----------+-------------+---------------+----------+-----------+------------+--------------+
| Test | Avg Time | Base Time | Time Change | % Time Change | Avg Mem  |  Base Mem | Mem Change | % Mem Change |
+------+----------+-----------+-------------+---------------+----------+-----------+------------+--------------+
|  0   |   131    |   33300   |    -33169   |      -100     | 57215999 | 335780375 | -278564376 |     -83      |
+------+----------+-----------+-------------+---------------+----------+-----------+------------+--------------+

Also added a test

Checked results were the same after sorting

Also a change to the script to compare queries that makes it easier to compare results that are different by piping them to files that can be diffed later

vercel · 2025-02-04T23:58:28Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated (UTC)
torchci	✅ Ready (Inspect)	Visit Preview	Feb 5, 2025 0:30am

huydhn · 2025-02-05T00:20:42Z

torchci/clickhouse_queries/oss_ci_benchmark_llms/query.sql

-        floor(arrayAvg(o.metric.benchmark_values), 2) AS actual,
-        floor(toFloat64(o.metric.target_value), 2) AS target,
-        o.benchmark.dtype AS dtype,
+        o.model.'name' AS model,


One of of confusing aspect of ClickHouse is that sometimes model.name works, sometimes it needs to be model.'name', and the other times it is tupleElement(model, 'name'). The last one is probably the clearest because the field is a tuple, but it's also the lengthiest

huydhn

LGTM! Thank you for the fix!

huydhn · 2025-02-05T00:21:33Z

I'm not sure why the preview doesn't show up because Vercel cancels it

huydhn · 2025-02-05T00:27:12Z

I think we should also add some more key to the projection if possible. They are the keys that I should have added when creating the table.

PROJECTION benchmark_name_projection
    (
        SELECT *
        ORDER BY 
            repo,
            tupleElement(benchmark, 'name'),
            tupleElement(model, 'name'),
            tupleElement(metric, 'name'),
            timestamp,
            head_branch,
            head_sha,
            workflow_id,
            job_id,
            servicelab_experiment_id,
            servicelab_trial_id
    ),

huydhn · 2025-02-05T00:33:46Z

~~Also, could you try update the two queries?~~

~~I'm currently using a materialize view for them, which I think is slower than this projection approach (when I try to load the page)~~

It's better done in a separate PR then, as I think #6167 needs to be reverted to try it out

clee2000 added 3 commits February 4, 2025 15:34

tc

6e570a8

tc

ef11db8

tc

6d62015

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 4, 2025

huydhn reviewed Feb 5, 2025

View reviewed changes

huydhn approved these changes Feb 5, 2025

View reviewed changes

vercel bot deployed to Preview February 5, 2025 00:30 View deployment

clee2000 marked this pull request as ready for review February 5, 2025 20:56

clee2000 merged commit 8f2607d into main Feb 5, 2025
8 checks passed

clee2000 deleted the csl/benchmark_llm_query_fast branch February 5, 2025 20:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make llm benchmarks query faster #6258

Make llm benchmarks query faster #6258

clee2000 commented Feb 4, 2025 •

edited

Loading

vercel bot commented Feb 4, 2025 •

edited

Loading

huydhn Feb 5, 2025

huydhn left a comment

huydhn commented Feb 5, 2025

huydhn commented Feb 5, 2025

huydhn commented Feb 5, 2025 •

edited

Loading

Make llm benchmarks query faster #6258

Make llm benchmarks query faster #6258

Conversation

clee2000 commented Feb 4, 2025 • edited Loading

vercel bot commented Feb 4, 2025 • edited Loading

huydhn Feb 5, 2025

Choose a reason for hiding this comment

huydhn left a comment

Choose a reason for hiding this comment

huydhn commented Feb 5, 2025

huydhn commented Feb 5, 2025

huydhn commented Feb 5, 2025 • edited Loading

clee2000 commented Feb 4, 2025 •

edited

Loading

vercel bot commented Feb 4, 2025 •

edited

Loading

huydhn commented Feb 5, 2025 •

edited

Loading