Feat/maddpg obs optim #459

AsadJeewa · 2022-03-22T21:31:32Z

What?

Temp branch for maddpg experiments

Why?

Benchmark to confirm updating observation network with only critic is best

How?

Benchmark with only critic gradients and previous (both actor and critic) and confirm no regression

AsadJeewa · 2022-04-20T09:09:30Z

By design, DDPG only uses the Critic loss to update the shared observation networks hence the results of previous investigations were inconclusive Neptune Results This has opened up 2 further investigations #481 & #482

arnupretorius

Looks good to me. Thanks @AsadJeewa 👍

DriesSmit

Looks good, thanks @AsadJeewa 🔥

KaleabTessera

Thanks @AsadJeewa !!! 🔥

Just requested a minor change.

mava/systems/tf/maddpg/networks.py

…deepai/Mava into feat/maddpg_obs_optim

AsadJeewa added 2 commits March 22, 2022 23:29

temp obs gradient var

28c3217

temp obs gradient var

d6595f7

AsadJeewa requested review from arnupretorius, KaleabTessera, DriesSmit, mmorris44, RuanJohn and jcformanek as code owners March 22, 2022 21:31

pull-request-size bot added the size/L label Mar 22, 2022

Merge branch 'develop' into feat/maddpg_obs_optim

f8810b1

AsadJeewa marked this pull request as draft March 22, 2022 21:33

AsadJeewa added the benchmark in progress label Mar 22, 2022

temp obs gradient var

ee366e7

AsadJeewa added benchmark in progress and removed benchmark in progress labels Mar 23, 2022

AsadJeewa added 2 commits March 23, 2022 16:30

feat: configurable obs network ddpg d4pg

0c49d7a

Merge branch 'develop' into feat/maddpg_obs_optim

78a4d7c

pull-request-size bot added size/M and removed size/L labels Mar 24, 2022

AsadJeewa added 5 commits March 31, 2022 16:39

debug: d4pg update_obs_once

c9a1902

remove debug code

771fc5a

remove debug code

50eaae7

Merge branch 'develop' into feat/maddpg_obs_optim

723e713

chore: remove debug code, add comment to ddpg

a7343c2

pull-request-size bot added size/S and removed size/M labels Apr 20, 2022

AsadJeewa marked this pull request as ready for review April 20, 2022 08:55

AsadJeewa added 2 commits April 20, 2022 10:55

Merge branch 'develop' into feat/maddpg_obs_optim

16c0fe1

chore: remove debug variable

67a2d43

arnupretorius previously approved these changes Apr 20, 2022

View reviewed changes

DriesSmit previously approved these changes Apr 21, 2022

View reviewed changes

Merge branch 'develop' into feat/maddpg_obs_optim

34c85b3

KaleabTessera suggested changes Apr 21, 2022

View reviewed changes

mava/systems/tf/maddpg/networks.py Outdated Show resolved Hide resolved

AsadJeewa added 2 commits April 21, 2022 14:43

chore: remove redundant code

80dd1df

fix: Merge branch 'feat/maddpg_obs_optim' of https://github.com/insta…

7c04224

…deepai/Mava into feat/maddpg_obs_optim

AsadJeewa dismissed stale reviews from DriesSmit and arnupretorius via 7c04224 April 21, 2022 12:49

chore: remove redundant code

897a08b

arnupretorius approved these changes Apr 21, 2022

View reviewed changes

KaleabTessera approved these changes Apr 21, 2022

View reviewed changes

DriesSmit approved these changes Apr 22, 2022

View reviewed changes

AsadJeewa merged commit dc7b93c into develop Apr 22, 2022

AsadJeewa deleted the feat/maddpg_obs_optim branch April 22, 2022 07:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/maddpg obs optim #459

Feat/maddpg obs optim #459

AsadJeewa commented Mar 22, 2022

AsadJeewa commented Apr 20, 2022 •

edited

Loading

arnupretorius left a comment

DriesSmit left a comment

KaleabTessera left a comment

Feat/maddpg obs optim #459

Feat/maddpg obs optim #459

Conversation

AsadJeewa commented Mar 22, 2022

What?

Why?

How?

AsadJeewa commented Apr 20, 2022 • edited Loading

arnupretorius left a comment

Choose a reason for hiding this comment

DriesSmit left a comment

Choose a reason for hiding this comment

KaleabTessera left a comment

Choose a reason for hiding this comment

AsadJeewa commented Apr 20, 2022 •

edited

Loading