Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Hands-On Reinforcement Learning for Games Implementing self-learning agents in games using artificial intelligence techniques

Product type Paperback

Published in Jan 2020

Publisher Packt

ISBN-13 9781839214936

Length 432 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Artificial Intelligence

Author (1):

Micheal Lanham

View More author details

Table of Contents (19) Chapters

Preface

1. Section 1: Exploring the Environment

2. Understanding Rewards-Based Learning FREE CHAPTER

3. Dynamic Programming and the Bellman Equation

4. Monte Carlo Methods

5. Temporal Difference Learning

6. Exploring SARSA

7. Section 2: Exploiting the Knowledge

8. Going Deep with DQN

9. Going Deeper with DDQN

10. Policy Gradient Methods

11. Optimizing for Continuous Control

12. All about Rainbow DQN

13. Exploiting ML-Agents

14. DRL Frameworks

15. Section 3: Reward Yourself

16. 3D Worlds

17. From DRL to AGI

18. Other Books You May Enjoy

Leave a review - let other readers know what you think

Understanding SARSA (λ)

We could, of course, implement TD (λ) using the tabular online method, which we haven't covered yet, or with Q-learning. However, since this is a chapter on SARSA, it only makes sense that we continue with that theme throughout. Open Chapter_5_4.py and follow the exercise:

The code is quite similar to our previous examples, but let's review the full source code, as follows:

import gym
import math
from copy import deepcopy
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

env = gym.make('MountainCar-v0')
Q_table = np.zeros((65,65,3))
alpha=0.3
buckets=[65, 65]
gamma=0.99
rewards=[]
episodes=2000
lambdaa=0.8

def to_discrete_states(observation):
 interval=[0 for i in range(len(observation))]
 max_range=[1.2,0.07] 
 for i in range(len(observation)):
  data = observation[i]
  inter = int(math.floor((data + max_range[i])...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Micheal Lanham

Micheal Lanham is a proven software and tech innovator with 20 years of experience. During that time, he has developed a broad range of software applications in areas such as games, graphics, web, desktop, engineering, artificial intelligence, GIS, and machine learning applications for a variety of industries as an R&D developer. At the turn of the millennium, Micheal began working with neural networks and evolutionary algorithms in game development. He was later introduced to Unity and has been an avid developer, consultant, manager, and author of multiple Unity games, graphic projects, and books ever since.

See other products by Micheal Lanham