Reversible layers increase memory usage #142

serkansulun · 2021-05-17T16:56:02Z

I'm checking memory usage using nvidia-smi. When I turn on reversibility (setting reverse_thres to two times the input length) it's using 8.8 GB memory. When I turn it off (setting reverse_thres to half of the input length), it's using 3.1 GB memory, and it is (naturally) faster. But the memory part doesn't make sense. What can be the problem here?

lzx325 · 2022-05-15T19:26:33Z

Same issue observed here. Is it because that Pytorch autograd is smart enough to identify on itself that it does not need to remember everything to compute the gradients of the reversible layers? Therefore, using the customized _ReversibleFunction will not provide any advantage?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reversible layers increase memory usage #142

Reversible layers increase memory usage #142

serkansulun commented May 17, 2021

lzx325 commented May 15, 2022

Reversible layers increase memory usage #142

Reversible layers increase memory usage #142

Comments

serkansulun commented May 17, 2021

lzx325 commented May 15, 2022