You're reading from Mastering Reinforcement Learning with Python Build next-generation, self-learning models using reinforcement learning techniques and best practices

Product type Paperback

Published in Dec 2020

Publisher Packt

ISBN-13 9781838644147

Length 544 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Reinforcement Learning

Author (1):

Enes Bilgin

View More author details

Table of Contents (24) Chapters

Preface

1. Section 1: Reinforcement Learning Foundations

2. Chapter 1: Introduction to Reinforcement Learning FREE CHAPTER

3. Chapter 2: Multi-Armed Bandits

4. Chapter 3: Contextual Bandits

5. Chapter 4: Makings of a Markov Decision Process

6. Chapter 5: Solving the Reinforcement Learning Problem

7. Section 2: Deep Reinforcement Learning

8. Chapter 6: Deep Q-Learning at Scale

9. Chapter 7: Policy-Based Methods

10. Chapter 8: Model-Based Methods

11. Chapter 9: Multi-Agent Reinforcement Learning

12. Section 3: Advanced Topics in RL

13. Chapter 10: Introducing Machine Teaching

14. Chapter 11: Achieving Generalization and Overcoming Partial Observability

15. Chapter 12: Meta-Reinforcement Learning

16. Chapter 13: Exploring Advanced Topics

17. Section 4: Applications of RL

18. Chapter 14: Solving Robot Learning

19. Chapter 15: Supply Chain Management

20. Chapter 16: Personalization, Marketing, and Finance

21. Chapter 17: Smart City and Cybersecurity

22. Chapter 18: Challenges and Future Directions in Reinforcement Learning

23. Other Books You May Enjoy

Leave a review - let other readers know what you think

Training tic-tac-toe agents through self-play

In this section, we will provide you with some key explanations of the code in our Github repo to get a better grasp of MARL with RLlib while training tic-tac-toe agents on a 3x3 board. For the full code, you can refer to https://github.com/PacktPublishing/Mastering-Reinforcement-Learning-with-Python.

Figure 9.5 – A 3x3 tic-tac-toe. For the image credit and to learn how it is played, see https://en.wikipedia.org/wiki/Tic-tac-toe

Let's started with designing the multi-agent environment.

Designing the multi-agent tic-tac-toe environment

In the game, we have two agents, X and O, playing the game. We will train four policies for the agents to pull their actions from, and each policy can play either an X or O. We construct the environment class as follows:

Chapter09/tic_tac_toe.py

class TicTacToe(MultiAgentEnv):
    def __init__(self, config=None):
   &...

The rest of the chapter is locked

You're reading from Mastering Reinforcement Learning with Python Build next-generation, self-learning models using reinforcement learning techniques and best practices

Table of Contents (24) Chapters

Training tic-tac-toe agents through self-play

Designing the multi-agent tic-tac-toe environment

Authors (1)

Other recommended products

Personalised recommendations for you

You're reading from Mastering Reinforcement Learning with Python Build next-generation, self-learning models using reinforcement learning techniques and best practices

Table of Contents (24) Chapters Close

Training tic-tac-toe agents through self-play

Designing the multi-agent tic-tac-toe environment

Unlock this book and the full library FREE for 7 days

Authors (1)

Other recommended products

Personalised recommendations for you

Table of Contents (24) Chapters