Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/recurrent and multiple trainer MAPPO #326

Merged
merged 72 commits into from
Apr 21, 2022

Conversation

DriesSmit
Copy link
Contributor

What?

Add recurrent training capabilities to MAPPO. Migrate MAPPO to now use the centralised variable source. This allows for training using multiple trainers.

Why?

This update allows for the use of MAPPO in the recurrent training setting and with multiple trainers.

How?

Adapted the MAPPO trainer to work in the recurrent setting. Added a recurrent executor to MAPPO. Migrated the old MAPPO code to use the new scaling version of Mava.

Extra

.

@DriesSmit DriesSmit self-assigned this Oct 31, 2021
@DriesSmit DriesSmit added the enhancement New feature or request label Oct 31, 2021
Copy link
Contributor

@KaleabTessera KaleabTessera left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks so much @DriesSmit 👐 This is really great! Just see my minor comments.

mava/systems/tf/maddpg/execution.py Show resolved Hide resolved
mava/systems/tf/maddpg/execution.py Show resolved Hide resolved
mava/systems/tf/mappo/builder.py Outdated Show resolved Hide resolved
mava/systems/tf/mappo/execution.py Outdated Show resolved Hide resolved
mava/systems/tf/mappo/execution.py Outdated Show resolved Hide resolved
mava/systems/tf/mappo/system.py Outdated Show resolved Hide resolved
mava/systems/tf/mappo/training.py Outdated Show resolved Hide resolved
mava/systems/tf/mappo/training.py Outdated Show resolved Hide resolved
mava/systems/tf/mappo/training.py Outdated Show resolved Hide resolved
mava/systems/tf/mappo/training.py Outdated Show resolved Hide resolved
arnupretorius
arnupretorius previously approved these changes Apr 20, 2022
Copy link
Collaborator

@arnupretorius arnupretorius left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @DriesSmit! 👍 Did a quick smoke review. Left minor comments. Happy with the benchmarking on this and will be useful for comparing with Jax systems.

mava/components/tf/architectures/centralised.py Outdated Show resolved Hide resolved
mava/components/tf/architectures/state_based.py Outdated Show resolved Hide resolved
mava/components/tf/architectures/state_based.py Outdated Show resolved Hide resolved
KaleabTessera
KaleabTessera previously approved these changes Apr 20, 2022
Copy link
Contributor

@KaleabTessera KaleabTessera left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @DriesSmit , great work on getting this in 🔥 👐 🔥

Just a minor commont on adding doc strings for fix_sampler.

mava/utils/sort_utils.py Show resolved Hide resolved
@DriesSmit DriesSmit dismissed stale reviews from KaleabTessera and arnupretorius via 60eb054 April 21, 2022 09:20
Copy link
Collaborator

@arnupretorius arnupretorius left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks Dries! 🔥

@DriesSmit DriesSmit merged commit 5877138 into develop Apr 21, 2022
@DriesSmit DriesSmit deleted the feature/recurrent-mappo branch April 21, 2022 09:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants