Feature/mava reproducibility and PZ wrapper fix #296

KaleabTessera · 2021-08-12T16:17:22Z

What?

Added seeds to networks for better reproducibility.
Fixed PZ wrapper issue.
Some cleanup and doc strings.

Why?

How?

Extra

DriesSmit

Thanks, @KaleabTessera 🔥 The changes look great. Just seem my few comments. I really like all the small code cleanups you did 😄

DriesSmit · 2021-08-13T08:43:39Z

mava/systems/tf/mad4pg/networks.py

-        "critics": critic_networks,
-        "observations": observation_networks,
-    }
+    return make_default_networks_maddpg(


Did we merge MADDPG and MAD4PG's networks? It does seem like a great idea as much of the code is shared 👍 Thanks @KaleabTessera 😄 It might be worth it to just check if both still work as expected.

I will run some benchmarks before merging this in 👍

DriesSmit · 2021-08-13T08:46:25Z

mava/systems/tf/maddpg/networks.py

+            # The multiplexer concatenates the observations/actions.
+            networks.CriticMultiplexer(),
+            networks.LayerNormMLP(
+                list(critic_networks_layer_sizes[key]) + [1],


Should the + [1] be here in the case of MAD4PG? Probably not right?

Combining the networks for MADDPG and MAD4PG looks great thanks @KaleabTessera 🙌

Ah yes, that is a bug, thanks @DriesSmit !

DriesSmit · 2021-08-13T08:57:20Z

mava/components/tf/networks/continuous.py

+from acme.tf.networks.continuous import ResidualLayernormWrapper
+
+
+def get_initialization(seed: Optional[int] = None) -> Any:


What makes all these new components different from Acme? Is it to be able to use a seed?

Yep. For most of this it is the seed and the networks are more configurable, i.e. layernorm and activations are configurable.

arnupretorius

Thanks @KaleabTessera! 💪 Looks great!

Happy to merge once we confirm the reproducibility of runs. 😄

arnupretorius · 2021-08-13T07:43:28Z

mava/components/tf/networks/continuous.py

+    def __init__(
+        self, output_size: int, scale: float = 1e-4, seed: Optional[int] = None
+    ):
+        """[summary]


Summary here :)

Lol, fixed.

arnupretorius

Thanks for the tests and confirmation runs @KaleabTessera 🔥 This is great! 🥳

KaleabTessera added 3 commits August 11, 2021 16:40

feat(networks): Added seed to networks for reproducibility.

581b994

fix(wrappers): Fixed final step in PZ wrapper + added docstrings.

5980cad

tests(wrappers): Wrote tests for empty obs fix.

54dc131

KaleabTessera requested review from arnupretorius and DriesSmit as code owners August 12, 2021 16:17

KaleabTessera self-assigned this Aug 12, 2021

KaleabTessera added the bug Something isn't working label Aug 12, 2021

KaleabTessera changed the title ~~Feature/mava reproducibility and adder fix~~ Feature/mava reproducibility and PZ wrapper fix Aug 12, 2021

DriesSmit reviewed Aug 13, 2021

View reviewed changes

KaleabTessera added 3 commits August 13, 2021 13:59

fix(debug): Fixed seeding debug env.

35ad7dd

fix(networks): Removed dangling [1] layer for mad4pg.

f70a140

chore: Removed unnecessary list.

d4df05a

DriesSmit self-requested a review August 13, 2021 15:04

DriesSmit approved these changes Aug 13, 2021

View reviewed changes

KaleabTessera added 2 commits August 16, 2021 12:53

test(environments): Test seeding envs.

3e40f61

chore(tests): Restructured adder tests.

97f2ce5

arnupretorius approved these changes Aug 16, 2021

View reviewed changes

KaleabTessera added 4 commits August 16, 2021 13:58

tests(networks): Test seeding networks.

b01a2a3

upgrade(pz): Upgraded pz version to latest.

c8c97e0

chore: Inc timeout github actions.

a382196

chore(dependencies): Upgrade lp version.

1f13f4c

arnupretorius approved these changes Aug 17, 2021

View reviewed changes

arnupretorius merged commit 0a952cc into develop Aug 17, 2021

arnupretorius deleted the feature/mava-reproducibility-and-adder-fix branch August 17, 2021 07:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/mava reproducibility and PZ wrapper fix #296

Feature/mava reproducibility and PZ wrapper fix #296

KaleabTessera commented Aug 12, 2021 •

edited

Loading

DriesSmit left a comment

DriesSmit Aug 13, 2021

KaleabTessera Aug 13, 2021

DriesSmit Aug 13, 2021

DriesSmit Aug 13, 2021

KaleabTessera Aug 13, 2021

DriesSmit Aug 13, 2021

KaleabTessera Aug 13, 2021

arnupretorius left a comment

arnupretorius Aug 13, 2021

KaleabTessera Aug 16, 2021

arnupretorius left a comment

		from acme.tf.networks.continuous import ResidualLayernormWrapper


		def get_initialization(seed: Optional[int] = None) -> Any:

Feature/mava reproducibility and PZ wrapper fix #296

Feature/mava reproducibility and PZ wrapper fix #296

Conversation

KaleabTessera commented Aug 12, 2021 • edited Loading

What?

Why?

How?

Extra

DriesSmit left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arnupretorius left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arnupretorius left a comment

Choose a reason for hiding this comment

KaleabTessera commented Aug 12, 2021 •

edited

Loading