WarmupDecayLR.warmup_num_steps must not be 0 or 1 #771

stas00 · 2021-02-20T17:36:04Z

When using WarmupDecayLR, either

the config checker must ensure that warmup_num_steps is not 0 or 1, because it's a 1 / log(warmup_num_steps)
or the code needs to be smart enough to handle these 2 cases internally, .e.g:

warmup_num_steps = max(2, warmup_num_steps)

Thank you.

Fix: #772

Error log from the test:

    def __init__(self,
                 optimizer: Optimizer,
                 warmup_min_lr: float = 0.0,
                 warmup_max_lr: float = 0.001,
                 warmup_num_steps: int = 1000,
                 last_batch_iteration: int = -1):
    
        self.optimizer = get_torch_optimizer(optimizer)
    
        self.min_lrs = self._format_param(self.optimizer, warmup_min_lr, "min_lr")
        self.max_lrs = self._format_param(self.optimizer, warmup_max_lr, "max_lr")
        self.delta_lrs = [big - small for big, small in zip(self.max_lrs, self.min_lrs)]
        self.warmup_num_steps = warmup_num_steps
>       self.inverse_log_warm_up = 1.0 / math.log(warmup_num_steps)
E       ZeroDivisionError: float division by zero

DeepSpeed/deepspeed/runtime/lr_schedules.py:710: ZeroDivisionError

The text was updated successfully, but these errors were encountered:

stas00 mentioned this issue Feb 20, 2021

[WarmupDecayLR] fix log(0) & 1/log(1) bugs #772

Merged

cli99 closed this as completed in #772 Mar 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WarmupDecayLR.warmup_num_steps must not be 0 or 1 #771

WarmupDecayLR.warmup_num_steps must not be 0 or 1 #771

stas00 commented Feb 20, 2021 •

edited

Loading

WarmupDecayLR.warmup_num_steps must not be 0 or 1 #771

WarmupDecayLR.warmup_num_steps must not be 0 or 1 #771

Comments

stas00 commented Feb 20, 2021 • edited Loading

stas00 commented Feb 20, 2021 •

edited

Loading