Information on maximum reward for roboschool environments? #186

elitalobo · 2019-04-20T21:13:41Z

No description provided.

elitalobo · 2019-04-20T21:14:24Z

Unable to find any documentation containing the baseline score or maximum reward details for roboschool environments.

wpumacay · 2019-04-21T01:26:06Z

I'm not sure if this is helpful, but you could take a look at the gym_forward_walker.py file. Inside the step method (line 119) the reward is computed from various components that make the whole reward function (+1 per alive-step, progress, actuation cost, ...). You could take a look at how they compute the rewards and get an idea of the actual ranges (I think the reward functions are not documented).

Perhaps what you are looking for is a standard range in which the rewards are going to be, like in the environments from dm_control. In their paper they specify that the rewards belong to a range of [0,1], and in their code they make the required scaling to make sure these ranges are in that range. For example, in the walker.py, they scale the components of the reward function to obtain a standard range of [0,1].

elitalobo changed the title ~~Can someone please tell me where can I find information on maximum reward for roboschool environments?~~ Information on maximum reward for roboschool environments? Apr 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Information on maximum reward for roboschool environments? #186

Information on maximum reward for roboschool environments? #186

elitalobo commented Apr 20, 2019

elitalobo commented Apr 20, 2019

wpumacay commented Apr 21, 2019

Information on maximum reward for roboschool environments? #186

Information on maximum reward for roboschool environments? #186

Comments

elitalobo commented Apr 20, 2019

elitalobo commented Apr 20, 2019

wpumacay commented Apr 21, 2019