Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Information on maximum reward for roboschool environments? #186

Open
elitalobo opened this issue Apr 20, 2019 · 2 comments
Open

Information on maximum reward for roboschool environments? #186

elitalobo opened this issue Apr 20, 2019 · 2 comments

Comments

@elitalobo
Copy link

No description provided.

@elitalobo
Copy link
Author

Unable to find any documentation containing the baseline score or maximum reward details for roboschool environments.

@elitalobo elitalobo changed the title Can someone please tell me where can I find information on maximum reward for roboschool environments? Information on maximum reward for roboschool environments? Apr 20, 2019
@wpumacay
Copy link

Hi @elitalobo

I'm not sure if this is helpful, but you could take a look at the gym_forward_walker.py file. Inside the step method (line 119) the reward is computed from various components that make the whole reward function (+1 per alive-step, progress, actuation cost, ...). You could take a look at how they compute the rewards and get an idea of the actual ranges (I think the reward functions are not documented).

Perhaps what you are looking for is a standard range in which the rewards are going to be, like in the environments from dm_control. In their paper they specify that the rewards belong to a range of [0,1], and in their code they make the required scaling to make sure these ranges are in that range. For example, in the walker.py, they scale the components of the reward function to obtain a standard range of [0,1].

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants