Independent Q-learning for BPD and Congestion Game #51

hiazmani · 2024-03-20T16:15:17Z

No description provided.

…f results with original paper

…Discrete

ffelten

Mostly LGTM. Just a few comments. We might have to harmonize the plot style for the paper but that is quite simple.

ffelten · 2024-03-21T07:47:55Z

momaland/learning/iql/iql.py

+            cap_min, cap_max, mix_min, mix_max = self.g_cap_min, self.g_cap_max, self.g_mix_min, self.g_mix_max
+
+        # Normalize the rewards
+        cap_norm = (reward[0] - cap_min) / (cap_max - cap_min)


I believe we have a NormalizeReward wrapper made for this. Is it different?

The NormalizeReward wrapper normalizes the rewards that the agents receive directly. In the BDP environment, agents can be rewarded according to two reward schemes (local/global), but the reported results in the graphs are always using the (normalized) 'global' reward scheme regardless of what the agents receive. This method is primarily used to compute these normalized global rewards.

momaland/learning/iql/iql.py

hiazmani and others added 8 commits January 24, 2024 14:47

IQL implementation for congestion game and beach domain.

7d73f54

Add wrapper for MO Beach Domain environment to allow for comparison o…

d60dfb7

…f results with original paper

Change type of observations of stateless congestion game from Box to …

d2bb12b

…Discrete

PF for PBD

eabc8ba

Fix bug where agents can not move left in BPD

187f3f7

IQL evaluation code for BPD

2152bf3

Move IQL files to separate directory

99f595f

Merge branch 'main' into iql

63c955a

hiazmani requested review from rradules and ffelten March 20, 2024 16:15

ffelten approved these changes Mar 21, 2024

View reviewed changes

rradules approved these changes Mar 21, 2024

View reviewed changes

hiazmani added 3 commits March 25, 2024 14:12

Move BPD wrapper to new file, fix paths of results and imports

6da8966

Align utils.py with changes from main

d8b1dc8

Fix pre-commit

47306a5

rradules merged commit 361e2e8 into main Mar 25, 2024
10 checks passed

rradules deleted the iql branch March 25, 2024 17:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Independent Q-learning for BPD and Congestion Game #51

Independent Q-learning for BPD and Congestion Game #51

hiazmani commented Mar 20, 2024

ffelten left a comment

ffelten Mar 21, 2024

hiazmani Mar 25, 2024

Independent Q-learning for BPD and Congestion Game #51

Independent Q-learning for BPD and Congestion Game #51

Conversation

hiazmani commented Mar 20, 2024

ffelten left a comment

Choose a reason for hiding this comment

ffelten Mar 21, 2024

Choose a reason for hiding this comment

hiazmani Mar 25, 2024

Choose a reason for hiding this comment