You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@qgallouedec one easy fix is to tell user to use exact_match=False and then save the policy again.
I'm working on a better one (or at least one that doesn't require user action).
🐛 Bug
Bug introduced in #1148
We might also want to check we are not saving the weights of the features extractor two times.
To Reproduce
Just load a pretrained PPO agent saved with SB3 v1.6.2 with a CNN policy.
python enjoy.py --algo a2c --env BreakoutNoFrameskip-v4
Found by updating the RL Zoo: https://github.com/DLR-RM/rl-baselines3-zoo/actions/runs/3760560053/jobs/6391404574
Relevant log output / Error message
System Info
No response
Checklist
The text was updated successfully, but these errors were encountered: