Compress each file in a ThreadPool #484

rik · 2023-04-02T22:15:45Z

I've tested this on a 2020 M1 MacBook Air (4 big + 4 small cores). In milliseconds.

Executable	Files	Brotli	Sequential	Parallel	Win
CLI	4656	No	3190	1050	3.0x
CLI	4656	Yes	78800	17630	4.2x
CLI	783	No	674	299	2.2x
CLI	783	Yes	10740	3630	2.9x
collectstatic	4656	No	8860	6790	1.3x
collectstatic	4656	Yes	82410	25110	3.3x
collectstatic	783	No	1510	1370	1.1x
collectstatic	783	Yes	11370	5120	2.2x

Commands to test yourself

CLI without Brotli

time python -m whitenoise.compress --quiet --no-brotli staticfiles

CLI with Brotli

time python -m whitenoise.compress --quiet staticfiles

collectstatic without Brotli

rm -rf staticfiles/; pip uninstall brotli --yes; time ./manage.py collectstatic --no-input

collectstatic with Brotli

rm -rf staticfiles/; pip install brotli; time ./manage.py collectstatic --no-input

rik · 2023-04-02T23:09:28Z

A previous version of this patch used ProcessPoolExecutor but that resulted in a slight slowdown for some non-Brotli cases.

rik · 2023-06-10T10:23:50Z

@evansd @adamchainz Apologies for pinging you. May I get some feedback on this approach?

stumpylog · 2023-10-19T19:49:20Z

It would be great to have this merged. I've patched in a Docker build, where it was made the build much faster, but it would be best to have it built in instead.

petrprikryl · 2023-10-21T22:08:14Z

Some bench for my project. Tested on i9-11900 (8 cores, 2 threads per core).

Without ThreadPool:

1037 static files copied to '/app/staticfiles', 4137 post-processed.

real    1m37.004s
user    1m35.530s
sys     0m1.356s

With ThreadPool:

1037 static files copied to '/app/staticfiles', 4137 post-processed.

real    0m28.889s
user    3m4.132s
sys     0m2.164s

In CI it goes from 213s to 38s 🚀

fix evansd#148

adamchainz · 2024-10-28T11:09:26Z

Thank you very much @rik . I eventually got round to checking this.

I did some refactoring, particularly to avoid adding extra methods that are only needed for this compression step. Using inner functions avoids bloating the classes :)

It would be nice to avoid the duplication between the different sites, but I don't have the energy to figure that out right now.

I checked that current tests would fail if any of the compression didn't work. I don't think there's an easy way to check the executor is used or works correctly, but at least the basics seem to pass.

Merging and releasing now.

Archmonger · 2024-10-28T11:14:45Z

@adamchainz A lot of cleanup for this PR occurred on ServeStatic. Could be worth a follow up PR.

EDIT: Looks like similar solutions were taken in your clean up, so can be skipped.

src/whitenoise/compress.py

Bumps [whitenoise](https://github.com/evansd/whitenoise) from 6.7.0 to 6.8.1. <details> <summary>Changelog</summary> Sourced from <a href="https://github.com/evansd/whitenoise/blob/main/docs/changelog.rst">whitenoise's changelog</a>. <blockquote> <h2>6.8.1 (2024-10-28)</h2> <ul> <li> Raise any errors from threads in the <code>whitenoise.compress</code> command. Regression in 6.8.0. Thanks to Tom Grainger for the spotting this with a <code>comment on PR [#484](evansd/whitenoise#484) <https://github.com/evansd/whitenoise/pull/484#discussion_r1818989096></code>__. </li> </ul> <h2>6.8.0 (2024-10-28)</h2> <ul> <li> Drop Django 3.2 to 4.1 support. </li> <li> Drop Python 3.8 support. </li> <li> Support Python 3.13. </li> <li> Fix a bug introduced in version 6.0.0 where <code>Range</code> requests could lead to database connection errors in other requests. Thanks to Per Myren for the detailed investigation and fix in <code>PR [#612](evansd/whitenoise#612) <https://github.com/evansd/whitenoise/pull/612></code>__. </li> <li> Use Django’s |FORCE_SCRIPT_NAME|__ setting correctly. This reverts a change from version 5.3.0 that added a call to Django’s |get_script_prefix() method|__ outside of the request-response cycle. .. |FORCE_SCRIPT_NAME| replace:: <code>FORCE_SCRIPT_NAME</code> __ <a href="https://docs.djangoproject.com/en/stable/ref/settings/#std:setting-FORCE_SCRIPT_NAME">https://docs.djangoproject.com/en/stable/ref/settings/#std:setting-FORCE_SCRIPT_NAME</a> .. |get_script_prefix() method| replace:: <code>get_script_prefix()</code> method __ <a href="https://docs.djangoproject.com/en/stable/ref/urlresolvers/#django.urls.get_script_prefix">https://docs.djangoproject.com/en/stable/ref/urlresolvers/#django.urls.get_script_prefix</a> Thanks to Sarah Boyce in <code>PR [#486](evansd/whitenoise#486) <https://github.com/evansd/whitenoise/pull/486></code>__. </li> <li> Compress files using a thread pool. This speeds up the compression step up to four times in benchmarks. Thanks to Anthony Ricaud in <code>PR [#484](evansd/whitenoise#484) <https://github.com/evansd/whitenoise/pull/484></code>__. </li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/evansd/whitenoise/commit/bfc5dae69aad3abac5ff763421a4b4e2ae8ab378"><code>bfc5dae</code></a> Version 6.8.1</li> <li><a href="https://github.com/evansd/whitenoise/commit/6bbec0fe2be3814a38f3c7585414174053b414ea"><code>6bbec0f</code></a> Raise errors from threads in whitenoise.compress (<a href="https://redirect.github.com/evansd/whitenoise/issues/615">#615</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/0b054e5b9706c2c02b561e3f9ab78e8700c09b69"><code>0b054e5</code></a> Version 6.8.0</li> <li><a href="https://github.com/evansd/whitenoise/commit/54c464a48fec7e0a153f5243d71a25be5be98e12"><code>54c464a</code></a> Upgrade and clarify Django quickstart docs (<a href="https://redirect.github.com/evansd/whitenoise/issues/548">#548</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/d5caf8daed1a488999d92d33d3b91778003d1e6a"><code>d5caf8d</code></a> Compress each file in a ThreadPool (<a href="https://redirect.github.com/evansd/whitenoise/issues/484">#484</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/9494ff362264b03fb5882dcd059766eaea896ae5"><code>9494ff3</code></a> Use settings.FORCE_SCRIPT_NAME correctly (<a href="https://redirect.github.com/evansd/whitenoise/issues/486">#486</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/c42e93cd5805b0f057eefa72d99e4e10e0ae4f12"><code>c42e93c</code></a> Make sure SlicedFile is closed properly (<a href="https://redirect.github.com/evansd/whitenoise/issues/612">#612</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/f8dff5026750df7e793a2bc7bd3aaf18f3065a85"><code>f8dff50</code></a> Drop Django 3.2 to 4.1 support (<a href="https://redirect.github.com/evansd/whitenoise/issues/614">#614</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/6450820afc42a59e15924d5fc6efcbaa0ab626f6"><code>6450820</code></a> Drop Python 3.8 support (<a href="https://redirect.github.com/evansd/whitenoise/issues/613">#613</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/946a95eadb27edd475d64bda93c69cf70a887025"><code>946a95e</code></a> [pre-commit.ci] pre-commit autoupdate (<a href="https://redirect.github.com/evansd/whitenoise/issues/611">#611</a>)</li> <li>Additional commits viewable in <a href="https://github.com/evansd/whitenoise/compare/6.7.0...6.8.1">compare view</a></li> </ul> </details> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=whitenoise&package-manager=pip&previous-version=6.7.0&new-version=6.8.1)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details>

Bumps [whitenoise](https://github.com/evansd/whitenoise) from 6.7.0 to 6.8.1. <details> <summary>Changelog</summary> Sourced from <a href="https://github.com/evansd/whitenoise/blob/main/docs/changelog.rst">whitenoise's changelog</a>. <blockquote> <h2>6.8.1 (2024-10-28)</h2> <ul> <li> Raise any errors from threads in the <code>whitenoise.compress</code> command. Regression in 6.8.0. Thanks to Tom Grainger for the spotting this with a <code>comment on PR [#484](evansd/whitenoise#484) <https://github.com/evansd/whitenoise/pull/484#discussion_r1818989096></code>__. </li> </ul> <h2>6.8.0 (2024-10-28)</h2> <ul> <li> Drop Django 3.2 to 4.1 support. </li> <li> Drop Python 3.8 support. </li> <li> Support Python 3.13. </li> <li> Fix a bug introduced in version 6.0.0 where <code>Range</code> requests could lead to database connection errors in other requests. Thanks to Per Myren for the detailed investigation and fix in <code>PR [#612](evansd/whitenoise#612) <https://github.com/evansd/whitenoise/pull/612></code>__. </li> <li> Use Django’s |FORCE_SCRIPT_NAME|__ setting correctly. This reverts a change from version 5.3.0 that added a call to Django’s |get_script_prefix() method|__ outside of the request-response cycle. .. |FORCE_SCRIPT_NAME| replace:: <code>FORCE_SCRIPT_NAME</code> __ <a href="https://docs.djangoproject.com/en/stable/ref/settings/#std:setting-FORCE_SCRIPT_NAME">https://docs.djangoproject.com/en/stable/ref/settings/#std:setting-FORCE_SCRIPT_NAME</a> .. |get_script_prefix() method| replace:: <code>get_script_prefix()</code> method __ <a href="https://docs.djangoproject.com/en/stable/ref/urlresolvers/#django.urls.get_script_prefix">https://docs.djangoproject.com/en/stable/ref/urlresolvers/#django.urls.get_script_prefix</a> Thanks to Sarah Boyce in <code>PR [#486](evansd/whitenoise#486) <https://github.com/evansd/whitenoise/pull/486></code>__. </li> <li> Compress files using a thread pool. This speeds up the compression step up to four times in benchmarks. Thanks to Anthony Ricaud in <code>PR [#484](evansd/whitenoise#484) <https://github.com/evansd/whitenoise/pull/484></code>__. </li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/evansd/whitenoise/commit/bfc5dae69aad3abac5ff763421a4b4e2ae8ab378"><code>bfc5dae</code></a> Version 6.8.1</li> <li><a href="https://github.com/evansd/whitenoise/commit/6bbec0fe2be3814a38f3c7585414174053b414ea"><code>6bbec0f</code></a> Raise errors from threads in whitenoise.compress (<a href="https://redirect.github.com/evansd/whitenoise/issues/615">#615</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/0b054e5b9706c2c02b561e3f9ab78e8700c09b69"><code>0b054e5</code></a> Version 6.8.0</li> <li><a href="https://github.com/evansd/whitenoise/commit/54c464a48fec7e0a153f5243d71a25be5be98e12"><code>54c464a</code></a> Upgrade and clarify Django quickstart docs (<a href="https://redirect.github.com/evansd/whitenoise/issues/548">#548</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/d5caf8daed1a488999d92d33d3b91778003d1e6a"><code>d5caf8d</code></a> Compress each file in a ThreadPool (<a href="https://redirect.github.com/evansd/whitenoise/issues/484">#484</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/9494ff362264b03fb5882dcd059766eaea896ae5"><code>9494ff3</code></a> Use settings.FORCE_SCRIPT_NAME correctly (<a href="https://redirect.github.com/evansd/whitenoise/issues/486">#486</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/c42e93cd5805b0f057eefa72d99e4e10e0ae4f12"><code>c42e93c</code></a> Make sure SlicedFile is closed properly (<a href="https://redirect.github.com/evansd/whitenoise/issues/612">#612</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/f8dff5026750df7e793a2bc7bd3aaf18f3065a85"><code>f8dff50</code></a> Drop Django 3.2 to 4.1 support (<a href="https://redirect.github.com/evansd/whitenoise/issues/614">#614</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/6450820afc42a59e15924d5fc6efcbaa0ab626f6"><code>6450820</code></a> Drop Python 3.8 support (<a href="https://redirect.github.com/evansd/whitenoise/issues/613">#613</a>)</li> <li><a href="https://github.com/evansd/whitenoise/commit/946a95eadb27edd475d64bda93c69cf70a887025"><code>946a95e</code></a> [pre-commit.ci] pre-commit autoupdate (<a href="https://redirect.github.com/evansd/whitenoise/issues/611">#611</a>)</li> <li>Additional commits viewable in <a href="https://github.com/evansd/whitenoise/compare/6.7.0...6.8.1">compare view</a></li> </ul> </details> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=whitenoise&package-manager=pip&previous-version=6.7.0&new-version=6.8.1)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

petrprikryl · 2024-10-29T09:55:24Z

Strange, after yesterday's:

force-pushed the multiprocess branch from 32e2611 to 511b57e

I have it 6x slower on my project: 1211 static files copied to '/app/staticfiles', 4971 post-processed.

I've tested also the newest release (6.8.1) and older 6.7.0 (before merge) with same results. No gains.

Archmonger · 2024-10-29T09:59:50Z

@petrprikryl I'm curious if it's related to the way the executor is written.

It's written a bit differently on ServeStatic. Would you mind testing ServeStatic==2.1.1?

If so then I can PR the fixes upstream.

Additionally, what operating system are you executing on?

adamchainz · 2024-10-29T10:28:49Z

I was also wondering if the default of up to 32 threads could lead to detrimental performance. Can you try with 4 or 8 threads on your project? Add max_workers= to the ThreadPoolExecutor creation.

Archmonger · 2024-10-29T10:30:17Z

Good thought - probably makes more sense for it to be CPU_CORES * 2

Archmonger · 2024-10-29T10:34:22Z

Just double checked standard lib, the default isn't 32.

Default is min(32, (os.cpu_count() or 1) + 4)

graingert · 2024-10-29T10:40:42Z

I wonder if the problem is materializing , 4971 futures all in one go and waiting on them all.

Archmonger · 2024-10-29T10:43:58Z

Creating the future itself should be an O(1) operation since it just enqueues it within the executor.

Would be a bit strange if doing the equivalent to appending objects to a list had a significant performance impact, but I've been proven wrong before 😅

petrprikryl · 2024-10-29T11:03:01Z

After my testing the number of threads isn't related to this. Looks like problem are generators:

This is fast:

    def _compress_one(self, name: str) -> list[tuple[str, str]]:
        compressed: list[tuple[str, str]] = []
        path = self.path(name)
        prefix_len = len(path) - len(name)
        for compressed_path in self.compressor.compress(path):
            compressed_name = compressed_path[prefix_len:]
            compressed.append((name, compressed_name))
        return compressed

This is 6x slower:

        def _compress_path(path: str) -> Generator[tuple[str, str]]:
            full_path = self.path(path)
            prefix_len = len(full_path) - len(path)
            for compressed_path in self.compressor.compress(full_path):
                compressed_name = compressed_path[prefix_len:]
                yield (path, compressed_name)

Tested on i9-11900 (8 cores, 2 threads per core).
Ubuntu 24.04.1 LTS
python:3.13-slim-bookworm Docker image

evansd#484 (comment)

adamchainz · 2024-10-29T23:23:57Z

Just double checked standard lib, the default isn't 32.

Default is min(32, (os.cpu_count() or 1) + 4)

Yes, sorry, was taking a shortcut. That works out as 32 on most machines (for now!).

akx · 2024-11-01T08:16:46Z

Yes, sorry, was taking a shortcut. That works out as 32 on most machines (for now!).

That's min(), not max() 😄 On my Macbook (M2 Max),

>>> import os
>>> min(32, (os.cpu_count() or 1) + 4)
16

rik mentioned this pull request Apr 2, 2023

Speeding up generating compressed files #148

Closed

rik force-pushed the multiprocess branch from 2eb8eed to 6d5ea36 Compare April 2, 2023 23:06

rik changed the title ~~Compress each file in a ProcessPool~~ Compress each file in a ThreadPool Apr 2, 2023

rik force-pushed the multiprocess branch from 6d5ea36 to 32e2611 Compare April 2, 2023 23:10

rik mentioned this pull request Apr 5, 2023

Improve our HTTP compression betagouv/SPPNautCarting#204

Closed

This comment was marked as spam.

Sign in to view

Archmonger mentioned this pull request Jul 11, 2024

Merge upstream PRs Archmonger/ServeStatic#9

Closed

5 tasks

Archmonger mentioned this pull request Aug 27, 2024

Compress files via threads Archmonger/ServeStatic#15

Merged

4 tasks

rik and others added 3 commits October 28, 2024 10:51

Compress each file in a ThreadPool

75cae39

fix evansd#148

Refactor to avoid extra methods, simpler imports

69f2d74

changelog note

511b57e

adamchainz force-pushed the multiprocess branch from 32e2611 to 511b57e Compare October 28, 2024 11:09

adamchainz merged commit d5caf8d into evansd:main Oct 28, 2024
13 checks passed

graingert reviewed Oct 28, 2024

View reviewed changes

src/whitenoise/compress.py Show resolved Hide resolved

adamchainz mentioned this pull request Oct 28, 2024

Raise errors from threads in whitenoise.compress #615

Merged

petrprikryl pushed a commit to petrprikryl/whitenoise that referenced this pull request Oct 29, 2024

fix compress speed - generators cannot yield from in threads?

2390e06

evansd#484 (comment)

petrprikryl mentioned this pull request Oct 29, 2024

Fix compression speed gains #616

Merged

rik deleted the multiprocess branch October 29, 2024 12:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compress each file in a ThreadPool #484

Compress each file in a ThreadPool #484

rik commented Apr 2, 2023 •

edited

Loading

rik commented Apr 2, 2023

rik commented Jun 10, 2023

stumpylog commented Oct 19, 2023

petrprikryl commented Oct 21, 2023

This comment was marked as spam.

adamchainz commented Oct 28, 2024

Archmonger commented Oct 28, 2024 •

edited

Loading

petrprikryl commented Oct 29, 2024

Archmonger commented Oct 29, 2024

adamchainz commented Oct 29, 2024

Archmonger commented Oct 29, 2024

Archmonger commented Oct 29, 2024

graingert commented Oct 29, 2024

Archmonger commented Oct 29, 2024

petrprikryl commented Oct 29, 2024

adamchainz commented Oct 29, 2024

akx commented Nov 1, 2024

Compress each file in a ThreadPool #484

Compress each file in a ThreadPool #484

Conversation

rik commented Apr 2, 2023 • edited Loading

rik commented Apr 2, 2023

rik commented Jun 10, 2023

stumpylog commented Oct 19, 2023

petrprikryl commented Oct 21, 2023

This comment was marked as spam.

adamchainz commented Oct 28, 2024

Archmonger commented Oct 28, 2024 • edited Loading

petrprikryl commented Oct 29, 2024

Archmonger commented Oct 29, 2024

adamchainz commented Oct 29, 2024

Archmonger commented Oct 29, 2024

Archmonger commented Oct 29, 2024

graingert commented Oct 29, 2024

Archmonger commented Oct 29, 2024

petrprikryl commented Oct 29, 2024

adamchainz commented Oct 29, 2024

akx commented Nov 1, 2024

rik commented Apr 2, 2023 •

edited

Loading

Archmonger commented Oct 28, 2024 •

edited

Loading