pin_memory should be true only if gpus are specified #245

igormq · 2020-09-22T16:05:36Z

Hey folks,

First of all, thank you for this amazing package. It has saved many hours of coding :)

I was using for the first time the pl data modules when I hit the following line
cifar10_datamodule.py#L132. The problem is: when I am training/evaluating/testing on CPU, the data module try to pin the data and try to reserve a batch size memory amount in gpu. Nothing goes wrong if I have GPU with enough memory ran, but, when I try to run in CPU mode with all my GPUs in use I get the following error:

RuntimeError: Caught RuntimeError in pin memory thread for device 0.
Original Traceback (most recent call last):
  File "/lib/python3.7/site-packages/torch/utils/data/_utils/pin_memory.py", line 31, in _pin_memory_loop
    data = pin_memory(data)
  File "/lib/python3.7/site-packages/torch/utils/data/_utils/pin_memory.py", line 55, in pin_memory
    return [pin_memory(sample) for sample in data]
  File "lib/python3.7/site-packages/torch/utils/data/_utils/pin_memory.py", line 55, in <listcomp>
    return [pin_memory(sample) for sample in data]
  File "lib/python3.7/site-packages/torch/utils/data/_utils/pin_memory.py", line 47, in pin_memory
    return data.pin_memory()
RuntimeError: cuda runtime error (2) : out of memory at /opt/conda/conda-bld/pytorch_1595629427478/work/aten/src/THC/THCCachingHostAllocator.cpp:278

The fix is simple and we can borrow the code from the pl core implementation as in pytorch_lightning/accelerators/accelerator_connector.py#L90

Bests.

The text was updated successfully, but these errors were encountered:

github-actions · 2020-09-22T16:06:20Z

Hi! thanks for your contribution!, great first issue!

nateraw · 2020-09-22T18:46:48Z

@igormq mind sending a PR? if not I can address this.

nateraw · 2020-09-22T18:47:35Z

I think the main thing here is to just include the pin_memory flag in the datamodules. I think by default it makes sense to set it to True either way? please correct me if I'm wrong 😄

briankosw · 2020-11-08T07:21:51Z

@nateraw I think even beyond the pin_memory flag, some other flags should also be included such as the shuffle and drop_last flag. What do you think? I can work on a pull request for updating the flags in the datamodules.

nateraw · 2020-11-18T03:04:01Z

@briankosw sure! for the shuffle and drop_last flags, lets just make sure we use sensible defaults. 😄 Thanks!!

igormq changed the title ~~pin_memory should be true only if it is a gpu training/test/eval~~ pin_memory should be true only if gpus is specified Sep 22, 2020

igormq changed the title ~~pin_memory should be true only if gpus is specified~~ pin_memory should be true only if gpus are specified Sep 22, 2020

Borda added fix fixing issues... good first issue Good for newcomers help wanted Extra attention is needed labels Oct 15, 2020

briankosw mentioned this issue Nov 20, 2020

Adding flags to datamodules #388

Merged

7 tasks

Borda closed this as completed in #388 Dec 16, 2020

Borda added this to the v0.3 milestone Jan 18, 2021

Borda added bug Something isn't working and removed fix fixing issues... labels Jun 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pin_memory should be true only if gpus are specified #245

pin_memory should be true only if gpus are specified #245

igormq commented Sep 22, 2020 •

edited

Loading

github-actions bot commented Sep 22, 2020

nateraw commented Sep 22, 2020

nateraw commented Sep 22, 2020

briankosw commented Nov 8, 2020

nateraw commented Nov 18, 2020

pin_memory should be true only if gpus are specified #245

pin_memory should be true only if gpus are specified #245

Comments

igormq commented Sep 22, 2020 • edited Loading

github-actions bot commented Sep 22, 2020

nateraw commented Sep 22, 2020

nateraw commented Sep 22, 2020

briankosw commented Nov 8, 2020

nateraw commented Nov 18, 2020

igormq commented Sep 22, 2020 •

edited

Loading