Typically, Q-Learning is taught using a grid problem such as the one we looked at it in the previous section. Here, though, we want something a little more complex and abstract that also allows you, the reader, to build on it and explore it further. We have put together an interesting example where we represent our bandits as rooms or objects with a number of exit options. This example could also very easily represent a dungeon or another connected room structure that you need to navigate an agent through. Follow these steps to get started on building the connected agents exercise:
- From the menu, select Assets -> Import Package -> Custom Package..., then navigate to the book's downloaded source code and import the Chapter_2_Connected_Bandits_unitypackage. This is the example, which has been fully constructed already for you. Apologies...