-
-
Notifications
You must be signed in to change notification settings - Fork 428
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MPE Continuous Action support #419
Conversation
Hey I haven't been tracking your conversation on discord, but you understand that our MPE environments are rather substantially fixed upband can't be directly compared to the ones in the original repo right? |
And does Ben have the needed information to look into the "supersuit related bug"? |
Yes, I understand-however, I think having the option of using continuous action spaces makes it much easier to use PettingZoo MPE with most papers using MPE, just because they've been using it as well. I've still kept the default actions to discrete, so the original behavior isn't affected. About simple_world_comm_v2 - I'm not sure what the bug is. If you run the script above with --env-name=simple_world_comm_v2, it crashes with different errors with SuperSuit 2.6.6 and 3.0.1, but it's about incorrect observation shapes for one of the agents with observation shape Discrete(34). |
So I enabled CI tests for your PR, and it looks like
Let me know if you have any questions. |
Note that supersuit is not in those tests, so it is not a supersuit issue. |
Ok, so I am reviewing the PR, and it generally looks good, really good job figuring out the tests and documentation. One issue is that the continuous action space for environments with both communication and movement (like simple_world_comm_v2) is not what it should be (this might be what is causing the test to fail). In particular, if you have 5 movement options and 4 communication options, then in a continuous space, the action should be of size 9 (the actions are concatenated), and in a discrete space, it should be of size 20 (the actions are a cross product of the two subspaces). The continuous space should break the action down into the movement and communication components differently in the continuous and discrete cases here: https://github.com/PettingZoo-Team/PettingZoo/blob/master/pettingzoo/mpe/_mpe_utils/simple_env.py#L94 The documentation should reflect this. Thanks again for working on this! |
Ok, so the action space and action handling looks good now. I don't like the way that the rendering misrepresents the communication (it renders the argmax of the actions), but that is inherited from the original MPE, and I don't know if its worth fixing for now. So I am happy with this PR. @jkterry1 Do you want to look over the documentation? I think its fine, but I have low standards, as you know. |
"I don't like the way that the rendering misrepresents the communication (it renders the argmax of the actions), but that is inherited from the original MPE, and I don't know if its worth fixing for now." @Rohan138 would you be willing to work on this? |
@benblack769 the docs look fine..? what body am I supposed to be looking for under the rug here |
@jkterry1 I don't see any dead bodies either, I'm just checking that you don't. |
I added something-just printing out the entire comm, rounded to 2dp - that's the best thing I could think of, although I agree that the argmax is unsatisfying. Perhaps a new PR? |
Can you check the "Allow edits from maintainers" box? There are a couple minor things I want to change. |
It's already checked, are you unable to edit? |
Ben that feature really doesn't work reliably on GitHub. It's a recurring
problem with my PRs to external repos too that I've never seen a solution
for.
On Sun, Jul 11, 2021 at 7:53 PM Rohan138 ***@***.***> wrote:
It's already checked, are you unable to edit?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#419 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AEUF33D2QI2NK4G4WR3UPWTTXJKJTANCNFSM5AAAFO7Q>
.
--
Thank you for your time,
Justin Terry
|
Hi, any updates on this? |
I'll try to meet with Ben today and deal with this |
Sorry for ignoring this. I just fiddled around a bit with the rendering, no real complains. |
So I am happy with this being merged |
I've added support for continuous actions in MPE as discussed in #249 through the continuous_actions=False argument in the environment config.
I've tested my changes on all environments using RLLib MADDPG, and run ./release_test.sh. The latter needed some updates as well.
All environments are working except simple_world_comm_v2, which seems to have a supersuit-related bug.