-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
max_parallel_tasks
and Dask
#2609
Comments
Indeed,
Note that this was already the case before #2522 was merged, but because #2522 builds one task graph for an entire preprocessing task (potentially many output files) instead of one per output file, the task graphs that are submitted to the scheduler are larger and therefore the problem will appear a bit more often. I added this note to the tutorial with a small explanation about it. A nice solution, that better utilizes the available Dask workers, would be to also build the task graphs on the Dask workers, as implemented in #2316. However, we cannot proceed with that at the moment because it is blocked because ESMF does not run on a Dask worker because it only runs in a single thread (or maybe it is possible, but I couldn't find a way to configure the workers such that it works). Another potential solution, that is less optimal but also requires fewer changes, would be to only build the tasks graphs in parallel using |
Issue reported by @schlunma
The text was updated successfully, but these errors were encountered: