SimCLR: using LARS with ddp_sharded causes TypeError: unexpected keyword argument 'lr' #562

hchau630 · 2021-02-17T00:27:17Z

🐛 Bug

Using a trainer with accelerator='ddp' and plugins='ddp_sharded' to train a SimCLR model with lars_wrapper=True causes the following error:

Traceback (most recent call last):
  File "/share/ctn/users/hc3190/issa/disentangle/bug.py", line 15, in <module>
    trainer.fit(model, datamodule=dm)
  File "/home/hc3190/.conda/envs/pytorch_env/lib/python3.8/site-packages/pytorch_lightning/trainer/trainer.py", line 510, in fit
    results = self.accelerator_backend.train()
  File "/home/hc3190/.conda/envs/pytorch_env/lib/python3.8/site-packages/pytorch_lightning/accelerators/ddp_accelerator.py", line 158, in train
    results = self.ddp_train(process_idx=self.task_idx, model=model)
  File "/home/hc3190/.conda/envs/pytorch_env/lib/python3.8/site-packages/pytorch_lightning/accelerators/ddp_accelerator.py", line 301, in ddp_train
    model = self.configure_ddp(model, device_ids)
  File "/home/hc3190/.conda/envs/pytorch_env/lib/python3.8/site-packages/pytorch_lightning/accelerators/ddp_accelerator.py", line 318, in configure_ddp
    model = self.ddp_plugin.configure_ddp(model, device_ids)
  File "/home/hc3190/.conda/envs/pytorch_env/lib/python3.8/site-packages/pytorch_lightning/plugins/sharded_plugin.py", line 38, in configure_ddp
    self._wrap_optimizers(model)
  File "/home/hc3190/.conda/envs/pytorch_env/lib/python3.8/site-packages/pytorch_lightning/plugins/sharded_plugin.py", line 60, in _wrap_optimizers
    self._reinit_with_fairscale_oss(trainer)
  File "/home/hc3190/.conda/envs/pytorch_env/lib/python3.8/site-packages/pytorch_lightning/plugins/sharded_plugin.py", line 69, in _reinit_with_fairscale_oss
    zero_optimizer = OSS(
  File "/home/hc3190/.conda/envs/pytorch_env/lib/python3.8/site-packages/fairscale/optim/oss.py", line 89, in __init__
    self.optim = optim(self.partition_parameters()[self.rank], **default)
TypeError: __init__() got an unexpected keyword argument 'lr'

The error disappears when either lars_wrapper = False or plugins = None. I suspect this is because the LARSWrapper does not belong to the torch Optimizer class and does not accept the keyword argument 'lr', unlike the usual torch optimizers, but fairscale treats the LARSWrapper as the usual torch Optimizer and passes in the 'lr' keyword argument anyway.

To Reproduce

Run the following code sample.

Code sample

import pytorch_lightning as pl
from pl_bolts.datamodules import ImagenetDataModule
from pl_bolts.models.self_supervised import SimCLR

IMAGENET_DIR_PATH = "/path/to/imagenet"
gpus = 4
batch_size = 32

dm = ImagenetDataModule(data_dir=IMAGENET_DIR_PATH, batch_size=batch_size)
trainer = pl.Trainer(gpus=gpus, accelerator='ddp', plugins='ddp_sharded', fast_dev_run=True)
model = SimCLR(gpus, dm.num_samples, batch_size, 'imagenet', lars_wrapper=True)
trainer.fit(model, datamodule=dm)

Expected behavior

Expect no errors.

Environment

PyTorch: 1.7.0
Lightning version: 1.1.8
Lightning bolts version: 0.3.0
Fairscale version: 0.1.6
Python version: 3.8.5
CUDA version: 11.1
GPU models and configuration: GeForce RTX 2080 Ti (x4)

The text was updated successfully, but these errors were encountered:

github-actions · 2021-02-17T00:27:59Z

Hi! thanks for your contribution!, great first issue!

hchau630 added fix fixing issues... help wanted Extra attention is needed labels Feb 17, 2021

akihironitta assigned ananyahjha93 Feb 19, 2021

ananyahjha93 mentioned this issue Apr 7, 2021

Updates all scripts to LARS #613

Merged

8 tasks

Borda closed this as completed in #613 Apr 12, 2021

Borda added bug Something isn't working and removed fix fixing issues... labels Jun 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SimCLR: using LARS with ddp_sharded causes TypeError: unexpected keyword argument 'lr' #562

SimCLR: using LARS with ddp_sharded causes TypeError: unexpected keyword argument 'lr' #562

hchau630 commented Feb 17, 2021 •

edited by akihironitta

Loading

github-actions bot commented Feb 17, 2021

SimCLR: using LARS with ddp_sharded causes TypeError: unexpected keyword argument 'lr' #562

SimCLR: using LARS with ddp_sharded causes TypeError: unexpected keyword argument 'lr' #562

Comments

hchau630 commented Feb 17, 2021 • edited by akihironitta Loading

🐛 Bug

To Reproduce

Code sample

Expected behavior

Environment

github-actions bot commented Feb 17, 2021

hchau630 commented Feb 17, 2021 •

edited by akihironitta

Loading