Exercises
As we've now completed the task, let's try our hand at two appropriate exercises based on the content that we've covered.
A simple RL task: Go to https://github.com/openai/gym. Once there, install the Gym environment and train an agent to solve the "Cartpole" problem.
A multi-agent RL task: Go to https://github.com/crazymuse/snakegame-numpy. This is a Gym environment that lets you play multiple agents in a "Snake" game. Experiment with different strategies. Can you create an agent that fools the other agent? What is the emergent behavior of the snakes?