Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DiverseMultiMNist config file. #2

Open
cinjon opened this issue Feb 19, 2020 · 8 comments
Open

DiverseMultiMNist config file. #2

cinjon opened this issue Feb 19, 2020 · 8 comments

Comments

@cinjon
Copy link

cinjon commented Feb 19, 2020

Hi, could you please post a config file for the DiverseMultiMNist?

@yaohungt
Copy link

You can modify the config based on Tables 10-12 in the paper.

@cinjon
Copy link
Author

cinjon commented Feb 19, 2020

Is this correct?

{'params': {'backbone': {'kernel_size': 3,
   'output_dim': 1024,
   'input_dim': 3,
   'stride': 2,
   'padding': 1,
   'out_img_size': 18},
  'primary_capsules': {'kernel_size': 3,
   'stride': 2,
   'input_dim': 1024,
   'caps_dim': 64,
   'num_caps': 16,
   'padding': 0,
   'out_img_size': 8},
  'capsules': [{'type': 'CONV',
    'num_caps': 16,
    'caps_dim': 64,
    'kernel_size': 3,
    'stride': 1,
    'matrix_pose': True,
    'out_img_size': 6},
   {'type': 'FC', 'num_caps': 10, 'caps_dim': 64, 'matrix_pose': True}],
  'class_capsules': {'num_caps': 10, 'caps_dim': 64, 'matrix_pose': True}}}

@yaohungt
Copy link

It seems syntactically correct.

@cinjon
Copy link
Author

cinjon commented Feb 19, 2020

Do you mind verifying that it is actually correct please? That would be quite helpful so that I don't spend a week debugging when it's not actually right.

@yaohungt
Copy link

I have no access to machines right now. But your config file is correct except that "True" should be "true".

We also conduct a variant of your config which is the vector-structured pose. This is done by setting 'matrix_pose' to false.

@cinjon
Copy link
Author

cinjon commented Feb 20, 2020

With the config as is, it reaches only 73% accuracy on the diverse multi mnist test set. That's notably lower than the result in the paper.

It also is at 9.98m parameters, which is a tiny bit larger than what you shared. So I suspect it's off in some way from the config you used.

Any idea where it could be off or perhaps some other part that's different between the setup (for CIFAR) in the repo and the setup for DMM? Fwiw, it reaches ~99.99%/100% on the training set np.

@yaohungt
Copy link

I'm not sure your training/test match the setting in the paper or not. You can first try the vanilla Capsules (Dynamic/ EM Capsules) on multimnist, and see what you get.

Moreover, I don't think I reach ~99.99% accuracy on the training set.

@Sharut
Copy link

Sharut commented Jun 29, 2020

Hi @yaohungt ,
Could you please share your code for generating the Diverse Multi MNIST Dataset. When I tried to implement it in MNIST, Using the above config file I am able to cross 85% with the matrix pose itself whereas your number in the paper was 80%.

I really need to figure out where exactly I am messing up or is there something else which is causing this difference?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants