Question about batch and subdivisions #1736

ydixon · 2018-10-08T18:11:23Z

From this cfg, batch=64, subdivision=16. Therefore, real batch size should be 64 / 16 = 4.

My question is whether the gradients of 64 images are accumulated before updating the model or gradients of 4 images are updated every iteration?

The text was updated successfully, but these errors were encountered:

AlexeyAB · 2018-10-08T21:54:51Z

The weights will be updated for each batch_cfg = 64.

mini_batch = net.batch = batch_cfg / subdivisions_cfg

darknet/src/network.c

Lines 315 to 326 in 24b6045

    
           int batch = net.batch; 
        
           int n = d.X.rows / batch; 
        
           float *X = calloc(batch*d.X.cols, sizeof(float)); 
        
           float *y = calloc(batch*d.y.cols, sizeof(float)); 
        
           int i; 
        
           float sum = 0; 
        
           for(i = 0; i < n; ++i){ 
        
               get_next_batch(d, batch, i*batch, X, y); 
        
               float err = train_network_datum(net, X, y); 
        
               sum += err; 
        
           }

float train_network_datum(network net, float *x, float *y) {

...

darknet/src/network.c

Line 290 in 24b6045

if(((*net.seen)/net.batch)%net.subdivisions == 0) update_network(net);

ydixon · 2018-10-09T04:20:10Z

Thanks

ydixon · 2018-10-15T10:32:44Z

@AlexeyAB Sorry for resurrecting this. If the condition to decide whether to update the network is if(((*net.seen)/net.batch)%net.subdivisions == 0) update_network(net);, doesn't it mean the weights will update every batch_cfg instead of mini-batch?

For example:

batch_cfg=64, subdivision_cfg=16
net.batch = batch_cfg / subdivision_cfg = 4
net.subdivisions = 16

Case 1: net.seen = 60

(net.seen/net.batch)%net.subdivisions = (60/4) % 4 = 3
Do not update

Case 2: net.seen = 128

(net.seen/net.batch)%net.subdivisions = (128/4) % 4 = 0
Update

AlexeyAB · 2018-10-15T11:30:33Z

@ydixon Yes, weights will be updated for each batch_cfg.

ydixon · 2018-10-15T16:01:15Z

@AlexeyAB Thanks for the quick response!

kmsravindra · 2018-11-19T11:29:02Z

@AlexeyAB , Can I train with batch=128 so that the trained model is more generalized than when batch=64? And then in that case, maybe I will have to train for almost the double number of iterations than when batch=64? So, the batch size could be a hyper parameter impacting mAP... is my understanding correct?

sctrueew · 2019-12-13T12:05:26Z

@AlexeyAB Hi,

I have 200k images and about 200 classes and I have two GPUs RTX 2080 Ti. My model is Gaussian.cfg, I want to know what is the batch and subdivisions should I set?

Thanks is advance.

AlexeyAB · 2019-12-13T12:17:19Z

batch=64 subdivisions=16

the lower subdivisions the better.

sctrueew · 2019-12-13T12:45:59Z

@AlexeyAB Hi,

Thanks, Can I stop the training and change the subdivisions and continue the training again?

AlexeyAB · 2019-12-13T12:46:27Z

yes

ydixon mentioned this issue Oct 15, 2018

different training results ultralytics/yolov3#22

Closed

This was referenced Apr 5, 2021

How to detect diffrent color for yolov4-mishx ? #7554

Open

Subdivisions vs network resolution #7556

Open

jwchoi384 mentioned this issue Jul 5, 2021

KITTI Training Performance on 3dop split jwchoi384/Gaussian_YOLOv3#69

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about batch and subdivisions #1736

Question about batch and subdivisions #1736

ydixon commented Oct 8, 2018

AlexeyAB commented Oct 8, 2018 •

edited

Loading

ydixon commented Oct 9, 2018

ydixon commented Oct 15, 2018 •

edited

Loading

AlexeyAB commented Oct 15, 2018

ydixon commented Oct 15, 2018

kmsravindra commented Nov 19, 2018 •

edited

Loading

sctrueew commented Dec 13, 2019

AlexeyAB commented Dec 13, 2019

sctrueew commented Dec 13, 2019

AlexeyAB commented Dec 13, 2019

Question about batch and subdivisions #1736

Question about batch and subdivisions #1736

Comments

ydixon commented Oct 8, 2018

AlexeyAB commented Oct 8, 2018 • edited Loading

ydixon commented Oct 9, 2018

ydixon commented Oct 15, 2018 • edited Loading

AlexeyAB commented Oct 15, 2018

ydixon commented Oct 15, 2018

kmsravindra commented Nov 19, 2018 • edited Loading

sctrueew commented Dec 13, 2019

AlexeyAB commented Dec 13, 2019

sctrueew commented Dec 13, 2019

AlexeyAB commented Dec 13, 2019

AlexeyAB commented Oct 8, 2018 •

edited

Loading

ydixon commented Oct 15, 2018 •

edited

Loading

kmsravindra commented Nov 19, 2018 •

edited

Loading