what's the difference between GBN & BN used in framework? #2

yljylj · 2018-05-25T03:18:46Z

I've read your paper. But I don't understand the difference between GBN & BN used in framework. In my understanding, GBN does BN with local data. For distributed frameworks, they also only do BN with local data. So can you explain it please?

Moxinilian · 2019-07-21T21:19:41Z

From what I understood in the paper, they are the same thing. In GBN, you artificially "isolate" parts of the batch when computing the values as if they were on distributed machines, even if you are training on a single system.

bonlime · 2019-07-24T06:59:32Z

@Moxinilian you're right. If you're interested in more efficient implementation you could check TF BatchNorm + virtual_batch_size param. They reshape the input and then batch norm it inside the BN layer instead of making separate passes for each mini-batch.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what's the difference between GBN & BN used in framework? #2

what's the difference between GBN & BN used in framework? #2

yljylj commented May 25, 2018

Moxinilian commented Jul 21, 2019

bonlime commented Jul 24, 2019

what's the difference between GBN & BN used in framework? #2

what's the difference between GBN & BN used in framework? #2

Comments

yljylj commented May 25, 2018

Moxinilian commented Jul 21, 2019

bonlime commented Jul 24, 2019