LocalCUDACluster's memory limit: `None` means no limit #943

madsbk · 2022-07-04T13:09:17Z

Use a regular dict when creating a LocalCUDACluster with no host and no device memory limit.

Currently, setting device_memory_limit=None translate into the total available GPU memory. However, in some cases DeviceHostFile overestimate the GPU memory usage, which can trigger spilling even though device_memory_limit=None.

codecov-commenter · 2022-07-04T13:32:30Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-22.08@b671e8d). Click here to learn what that means.
The diff coverage is n/a.

❗ Current head 45e246b differs from pull request most recent head c1a012d. Consider uploading reports for the commit c1a012d to get more accurate results

@@              Coverage Diff               @@
##             branch-22.08    #943   +/-   ##
==============================================
  Coverage                ?   0.00%           
==============================================
  Files                   ?      16           
  Lines                   ?    2105           
  Branches                ?       0           
==============================================
  Hits                    ?       0           
  Misses                  ?    2105           
  Partials                ?       0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b671e8d...c1a012d. Read the comment docs.

dask_cuda/local_cuda_cluster.py

pentschev

LGTM, thanks @madsbk .

pentschev · 2022-07-04T15:31:26Z

@gpucibot merge

When setting no host and no device memory limit, use regular dict

c1a012d

madsbk added bug Something isn't working 2 - In Progress Currently a work in progress non-breaking Non-breaking change labels Jul 4, 2022

github-actions bot added the python python code needed label Jul 4, 2022

wence- reviewed Jul 4, 2022

View reviewed changes

dask_cuda/local_cuda_cluster.py Show resolved Hide resolved

madsbk changed the title ~~LocalCUDACluster: no memory limits means *no* limit~~ LocalCUDACluster's memory limit: no means no Jul 4, 2022

madsbk marked this pull request as ready for review July 4, 2022 14:20

madsbk requested a review from a team as a code owner July 4, 2022 14:20

madsbk added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Jul 4, 2022

pentschev reviewed Jul 4, 2022

View reviewed changes

pentschev changed the title ~~LocalCUDACluster's memory limit: no means no~~ LocalCUDACluster's memory limit: None means no limit Jul 4, 2022

pentschev approved these changes Jul 4, 2022

View reviewed changes

rapids-bot bot merged commit 8dba7d1 into rapidsai:branch-22.08 Jul 4, 2022

madsbk deleted the CUDAWorker_no_device_limit branch January 24, 2023 12:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LocalCUDACluster's memory limit: `None` means no limit #943

LocalCUDACluster's memory limit: `None` means no limit #943

madsbk commented Jul 4, 2022

codecov-commenter commented Jul 4, 2022 •

edited

Loading

pentschev left a comment

pentschev commented Jul 4, 2022

LocalCUDACluster's memory limit: None means no limit #943

LocalCUDACluster's memory limit: None means no limit #943

Conversation

madsbk commented Jul 4, 2022

codecov-commenter commented Jul 4, 2022 • edited Loading

Codecov Report

pentschev left a comment

Choose a reason for hiding this comment

pentschev commented Jul 4, 2022

LocalCUDACluster's memory limit: `None` means no limit #943

LocalCUDACluster's memory limit: `None` means no limit #943

codecov-commenter commented Jul 4, 2022 •

edited

Loading