Improved caffe converter #6822

arikpoz · 2017-06-26T19:07:10Z

Extended caffe-to-mxnet converter and improved converter test

Added support for networks which uses batch normalization without a scale layer following the batch norm, i.e. gamma is fixed to 1
Extended naming convention used when implementing batch normalization in caffe
Added support for old caffe versions where dilation didn't exist. This is needed to convert models which depends on old caffe
Added support for deconvolution layer
Added support for older version of caffe where kernel_size, pad and stride parameters were not iterable
Fixed crash happening when a bottom layer doesn't exist in the internal top_to_layers dictionary, this can happen if the name of the input is not 'data'
Added ignore-by-design support for converting 'Crop' layers
Fixed batch norm layer comparison to take into account the rescaling factor
Added careful condition in tester to swap (RGB,BGR) input channels only if they are of size 3 or 4, which is the same check the conversion does
Allow comparing layers of models with no mean file
Added support for comparing the parameters of deconvolution layers

piiswrong · 2017-06-26T19:15:27Z

Please revert the submodule change.

arikpoz · 2017-06-26T19:32:09Z

Done.

arikpoz · 2017-06-26T19:46:31Z

build still fails, @piiswrong can you check the log to see if this is something I can fix?

piiswrong · 2017-06-26T20:32:40Z

Error is due to you changed submodule: https://github.com/dmlc/mxnet/pull/6822/files

arikpoz · 2017-06-26T20:42:18Z

But I reverted this change, or so I thought.
And I think rebase from upstream made it in the first place..
Can you suggest how to fix?

piiswrong · 2017-06-26T20:48:49Z

Try checkout upstream master and cherry pick your changes.

- added support for networks which uses batch normalization without a scale layer following the batch norm, i.e. gamma is fixed to 1 - extended naming convention used when implementing batch normalization in caffe - added support for old caffe versions where dilation didn't exist. This is needed to convert models which depends on old caffe - added support for deconvolution layer - added support for older version of caffe where kernel_size, pad and stride parameters were not iterable - fixed crash happening when a bottom layer doesn't exist in the internal top_to_layers dictionary, this can happen if the name of the input is not 'data' - added ignore-by-design support for converting 'Crop' layers - fixed batch norm layer comparison to take into account the rescaling factor - added careful condition in tester to swap (RGB,BGR) input channels only if they are of size 3 or 4, which is the same check the conversion does - allow comparing layers of models with no mean file - added support for comparing the parameters of deconvolution layers

arikpoz · 2017-06-27T04:59:59Z

submodules issue seem resolved.

piiswrong · 2017-06-27T05:38:43Z

Is caffe converter tested? Maybe we should setup CI?

@mli Could you have a look?

arikpoz · 2017-06-27T16:42:40Z

The only error is due to test timeout, can someone check this, maybe update the timeout?

mli · 2017-06-27T17:57:19Z

the caffe converter is tested in the CI, but we only tried to convert a few commonly cnns, such as vgg/resnet. Each test is expensive.

@arikpoz do you have a better idea how to test the converter?

arikpoz · 2017-06-27T18:06:18Z

The current test seems good to me, checking both performance and layer by layer outputs.

What takes times is the performance test since we run inference on many images.
To save time, we can remove the performance test, and trust that if the layer by layer outputs are close enough, then the end performance should be the same.

The layer by layer test is done on a single image comparing all the outputs (and network parameters) of the caffe network and the mxnet network. Since we run inference on one image only, it is very fast to execute.

my 2c.

…he#6822) - added support for networks which uses batch normalization without a scale layer following the batch norm, i.e. gamma is fixed to 1 - extended naming convention used when implementing batch normalization in caffe - added support for old caffe versions where dilation didn't exist. This is needed to convert models which depends on old caffe - added support for deconvolution layer - added support for older version of caffe where kernel_size, pad and stride parameters were not iterable - fixed crash happening when a bottom layer doesn't exist in the internal top_to_layers dictionary, this can happen if the name of the input is not 'data' - added ignore-by-design support for converting 'Crop' layers - fixed batch norm layer comparison to take into account the rescaling factor - added careful condition in tester to swap (RGB,BGR) input channels only if they are of size 3 or 4, which is the same check the conversion does - allow comparing layers of models with no mean file - added support for comparing the parameters of deconvolution layers

arikpoz force-pushed the improved_caffe_converter branch from bb5af5c to c671c9e Compare June 26, 2017 21:02

Merge branch 'master' into improved_caffe_converter

c142aac

Merge branch 'master' into improved_caffe_converter

df445ec

piiswrong requested a review from mli June 27, 2017 17:14

piiswrong merged commit 8c81ee4 into apache:master Jun 28, 2017

arikpoz deleted the improved_caffe_converter branch June 29, 2017 05:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved caffe converter #6822

Improved caffe converter #6822

arikpoz commented Jun 26, 2017

piiswrong commented Jun 26, 2017

arikpoz commented Jun 26, 2017

arikpoz commented Jun 26, 2017

piiswrong commented Jun 26, 2017

arikpoz commented Jun 26, 2017

piiswrong commented Jun 26, 2017 •

edited

Loading

arikpoz commented Jun 27, 2017

piiswrong commented Jun 27, 2017

arikpoz commented Jun 27, 2017

mli commented Jun 27, 2017

arikpoz commented Jun 27, 2017

Improved caffe converter #6822

Improved caffe converter #6822

Conversation

arikpoz commented Jun 26, 2017

piiswrong commented Jun 26, 2017

arikpoz commented Jun 26, 2017

arikpoz commented Jun 26, 2017

piiswrong commented Jun 26, 2017

arikpoz commented Jun 26, 2017

piiswrong commented Jun 26, 2017 • edited Loading

arikpoz commented Jun 27, 2017

piiswrong commented Jun 27, 2017

arikpoz commented Jun 27, 2017

mli commented Jun 27, 2017

arikpoz commented Jun 27, 2017

piiswrong commented Jun 26, 2017 •

edited

Loading