fix: small bug in the pettingzoo wrapper related to legal action masking #432

jcformanek · 2022-02-24T08:56:59Z

What?

The legal action mask in pettingzoo Pong was causing the madqn example to fail. This is because MADQN expects the legal action mask to be a vector of ones and zeros specifying for each action if it is a valid action or not. The pettingzoo wrapper was returning only a single one value for the legal action mask.

Also fixed problem in TicTacToe example. The MADQN executor needs a select_action method for single agents.

Why?

The MADQN example on Pong was failing.

How?

Simple fix in the pettingzoo wrapper.

MADQN needs select_action method for single agents in sequential games like Tic Tac Toe.

Extra

N/A

…toe example

jcformanek · 2022-02-24T09:12:24Z

@AsadJeewa see the fixes here.

arnupretorius

Looks good to me. Thanks @jcformanek 👍

We should just benchmark to make sure it is working as expected.

AsadJeewa · 2022-02-24T09:20:06Z

Thanks for the quick fix @jcformanek

AsadJeewa

Looks good and runs on my side without any errors

mava/wrappers/pettingzoo.py

mava/systems/tf/madqn/execution.py

mava/wrappers/pettingzoo.py

fix: small bug in the pettingzoo wrapper related to legal action masking

6d5c478

jcformanek requested review from arnupretorius, KaleabTessera, DriesSmit, mmorris44, AsadJeewa and RuanJohn as code owners February 24, 2022 08:56

jcformanek self-assigned this Feb 24, 2022

jcformanek added the bug Something isn't working label Feb 24, 2022

jcformanek linked an issue Feb 24, 2022 that may be closed by this pull request

[BUG] Recurrent MADQN failing with shape errors on Pong environment #430

Closed

fix: missing select_action method in madqn executor breaks tick tack …

aef5e2b

…toe example

pull-request-size bot added the size/M label Feb 24, 2022

jcformanek linked an issue Feb 24, 2022 that may be closed by this pull request

[BUG] Openspiel TicTacToe example (MADQN) error #431

Closed

arnupretorius approved these changes Feb 24, 2022

View reviewed changes

AsadJeewa approved these changes Feb 24, 2022

View reviewed changes

mmorris44 reviewed Feb 24, 2022

View reviewed changes

mava/wrappers/pettingzoo.py Show resolved Hide resolved

mava/systems/tf/madqn/execution.py Show resolved Hide resolved

mava/wrappers/pettingzoo.py Show resolved Hide resolved

mmorris44 approved these changes Feb 24, 2022

View reviewed changes

arnupretorius merged commit 1e21726 into develop Feb 24, 2022

arnupretorius deleted the fix/example-madqn-on-pong branch February 24, 2022 12:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: small bug in the pettingzoo wrapper related to legal action masking #432

fix: small bug in the pettingzoo wrapper related to legal action masking #432

jcformanek commented Feb 24, 2022 •

edited

Loading

jcformanek commented Feb 24, 2022

arnupretorius left a comment

AsadJeewa commented Feb 24, 2022

AsadJeewa left a comment

fix: small bug in the pettingzoo wrapper related to legal action masking #432

fix: small bug in the pettingzoo wrapper related to legal action masking #432

Conversation

jcformanek commented Feb 24, 2022 • edited Loading

What?

Why?

How?

Extra

jcformanek commented Feb 24, 2022

arnupretorius left a comment

Choose a reason for hiding this comment

AsadJeewa commented Feb 24, 2022

AsadJeewa left a comment

Choose a reason for hiding this comment

jcformanek commented Feb 24, 2022 •

edited

Loading