Converting a model into PyTorch #372

p-christ · 2019-06-14T14:01:52Z

Hi,

I'm trying to load a pre-trained model and convert it into a PyTorch model but can't get it to work and was wondering if someone could help me.

I'm able to load the pre-trained model using stable baselines and copy the weights over to a PyTorch model. But then when I play the game the pytorch agent is not able to get the same score as the baselines agent and i am not sure why. It could potentially be because the baselines agent does some extra pre-processing behind the scenes besides just normalising the state to the 0-1 range? Is anyone able to help me?

I've made a colab to demonstrate the problem here: https://colab.research.google.com/drive/1-IIjA1oKUjg5eoctajpl06OoHzU-5-_9

araffin · 2019-06-14T14:22:05Z

Hello,

It could potentially be because the baselines agent does some extra pre-processing behind the scenes besides just normalising the state to the 0-1 range?

There is no such up to my knowledge. However, I would recommend you first trying with a simpler network (e.g. on CartPole-v1), because it gets tricky with convolution (different conventions for pytorch and tensorflow).
I would also suggest you to check all shapes and names before assigning the weights, using .named_parameters() for pytorch.

EDIT: I made it work with CartPole, will share the notebook soon

araffin · 2019-06-14T14:43:06Z

Update: I made it work with CartPole, you can find the notebook here: https://colab.research.google.com/drive/1R-wHO2gLQScx46EIjqj7Sj6MjK-i5Hey

Will try to make it work with the cnn if I have some time this weekend.

araffin · 2019-06-20T15:03:26Z

Update: I'm working on the CNN now, it seems that the problem comes with the first fully connected layer (the conv layer outputs the right thing).

araffin · 2019-06-20T16:01:36Z

The problem comes from the reshape (from conv to fc)

araffin · 2019-06-20T16:21:11Z

@p-christ , I solved the issue doing that before flattening:

# shape before flattening
# tf: (?, 7, 7, 64)
# pytorch: [1, 64, 7, 7]
x = x.permute(0, 2, 3, 1).contiguous()
x = x.view(x.size(0), -1)

araffin · 2019-06-20T16:33:07Z

@p-christ I made a working colab notebook: https://colab.research.google.com/drive/1XwCWeZPnogjz7SLW2kLFXEJGmynQPI-4

The problem came from tensorflow/pytorch differences, not SB.

Closing the issue.

p-christ · 2019-06-26T12:45:02Z

thanks a lot

Chainesh · 2024-04-01T07:33:22Z

Can learnings from this be documented, also if it's possible to make a function to do same in sb3?

araffin added the question Further information is requested label Jun 14, 2019

araffin closed this as completed Jun 20, 2019

Miffyli mentioned this issue Sep 15, 2019

Add documentation on exporting models #475

Merged

araffin mentioned this issue Apr 5, 2020

[question] Multimodal input #784

Closed

Miffyli mentioned this issue Apr 27, 2020

V3 new backend: PyTorch? and the future of Stable Baselines #733

Closed

crobarcro mentioned this issue Jul 5, 2020

Trained policy export to ONNX via PyTorch #922

Closed

araffin mentioned this issue Jul 20, 2020

Match performance with stable-baselines (discrete case) DLR-RM/stable-baselines3#110

Merged

20 tasks

araffin mentioned this issue Nov 25, 2020

Where are the 120+ Trained Agents? DLR-RM/rl-trained-agents#1

Closed

Hurdle97 mentioned this issue Nov 25, 2023

[Question] How to correctly export a TD3 model DLR-RM/stable-baselines3#1766

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Converting a model into PyTorch #372

Converting a model into PyTorch #372

p-christ commented Jun 14, 2019

araffin commented Jun 14, 2019 •

edited

Loading

araffin commented Jun 14, 2019

araffin commented Jun 20, 2019

araffin commented Jun 20, 2019

araffin commented Jun 20, 2019

araffin commented Jun 20, 2019

p-christ commented Jun 26, 2019

Chainesh commented Apr 1, 2024

Converting a model into PyTorch #372

Converting a model into PyTorch #372

Comments

p-christ commented Jun 14, 2019

araffin commented Jun 14, 2019 • edited Loading

araffin commented Jun 14, 2019

araffin commented Jun 20, 2019

araffin commented Jun 20, 2019

araffin commented Jun 20, 2019

araffin commented Jun 20, 2019

p-christ commented Jun 26, 2019

Chainesh commented Apr 1, 2024

araffin commented Jun 14, 2019 •

edited

Loading