Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Keras Reinforcement Learning Projects 9 projects exploring popular reinforcement learning techniques to build self-learning agents

Product type Paperback

Published in Sep 2018

Publisher Packt

ISBN-13 9781789342093

Length 288 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Reinforcement Learning

Author (1):

Giuseppe Ciaburro

View More author details

Table of Contents (13) Chapters

Preface

1. Overview of Keras Reinforcement Learning FREE CHAPTER

2. Simulating Random Walks

3. Optimal Portfolio Selection

4. Forecasting Stock Market Prices

5. Delivery Vehicle Routing Application

6. Continuous Balancing of a Rotating Mechanical System

7. Dynamic Modeling of a Segway as an Inverted Pendulum System

8. Robot Control System Using Deep Reinforcement Learning

9. Handwritten Digit Recognizer

10. Playing the Board Game Go

11. What's Next?

12. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

In this chapter, TD learning algorithms were introduced. TD learning algorithms are based on reducing the differences between estimates made by the agent at different times. The SARSA algorithm implements an on-policy TDs method, in which the update of the action value function (Q) is performed based on the results of the transition from the state s = s (t) to the state s' = s (t + 1) by the action a (t), taken on the basis of a selected policy π (s, a). Q-learning, unlike SARSA, has off-policy characteristics, that is, while the policy is improved according to the values estimated by q(s, a), the value function updates the estimates following a strictly greedy secondary policy: given a state, the chosen action is always the one that maximizes the value max_{𝑞 (𝑠, 𝑎)}.

Then, the basics of graph theory were addressed: the adjacency matrix...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Giuseppe Ciaburro

Giuseppe Ciaburro holds a PhD and two master's degrees. He works at the Built Environment Control Laboratory - Università degli Studi della Campania "Luigi Vanvitelli". He has over 25 years of work experience in programming, first in the field of combustion and then in acoustics and noise control. His core programming knowledge is in MATLAB, Python and R. As an expert in AI applications to acoustics and noise control problems, Giuseppe has wide experience in researching and teaching. He has several publications to his credit: monographs, scientific journals, and thematic conferences. He was recently included in the world's top 2% scientists list by Stanford University (2022).

See other products by Giuseppe Ciaburro