Reinforcement learning tasks
Reinforcement learning consists of training an agent, that just needs occasional feedback from the environment, to learn to get the best feedback at the end. The agent performs actions, modifying the state of the environment.
The actions to navigate in the environment can be represented as directed edges from one state to another state as a graph, as shown in the following figure:
A robot, working in a real environment (walking robots, control of motors, and so on) or a virtual environment (video game, online games, chat room, and so on) has to decide which movements (or keys to strike) to receive the maximum reward: