step function effect in fourrooms doesn't fit the description #11

Mingpan · 2018-04-17T13:13:13Z

In the description it reads:
The agent can perform one of four actions,
up, down, left or right, which have a stochastic effect. With probability 2/3, the actions
cause the agent to move one cell in the corresponding direction, and with probability 1/3,
the agent moves instead in one of the other three directions, each with 1/9 probability. In
either case, if the movement would take the agent into a wall then the agent remains in the
same cell.

However, if I understand the code correctly, the self.currentcell is set to be the next cell before the random process happens. Therefore the agent will land in next cell with prob 2/3, which is correct, but with prob 1/3 it will land in the neighboring cells of the nextcell (equivalent to taking an additional step). That doesn't seem to fit the description above and in the paper.

tsaket · 2018-08-27T20:40:18Z

+1

ankeshanand mentioned this issue Jun 24, 2018

Bug in the FourRooms env implementation openai/mlsh#15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

step function effect in fourrooms doesn't fit the description #11

step function effect in fourrooms doesn't fit the description #11

Mingpan commented Apr 17, 2018

tsaket commented Aug 27, 2018

step function effect in fourrooms doesn't fit the description #11

step function effect in fourrooms doesn't fit the description #11

Comments

Mingpan commented Apr 17, 2018

tsaket commented Aug 27, 2018