The FrozenLake environment is a 4 × 4 grid that contains four possible areas: Safe (S), Frozen (F), Hole (H), and Goal (G). The agent controls the movement of a character in a grid world, and moves around the grid until it reaches the goal or the hole. Some tiles of the grid are walkable, and others lead to the agent falling into the water. If it falls into the hole, it has to start from the beginning and is rewarded the value 0. Additionally, the movement direction of the agent is uncertain and only partially depends on the chosen direction. The agent is rewarded for finding a walkable path to a goal tile. The agent has four possible moves: up, down, left, and right. The process continues until it learns from every mistake and reaches the goal eventually.
The surface is described using a grid like the following:
- SFFF (S: starting point, safe) ...