Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Hands-On Neural Networks with Keras

You're reading from   Hands-On Neural Networks with Keras Design and create neural networks using deep learning and artificial intelligence principles

Arrow left icon
Product type Paperback
Published in Mar 2019
Publisher Packt
ISBN-13 9781789536089
Length 462 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Niloy Purkait Niloy Purkait
Author Profile Icon Niloy Purkait
Niloy Purkait
Arrow right icon
View More author details
Toc

Table of Contents (16) Chapters Close

Preface 1. Section 1: Fundamentals of Neural Networks FREE CHAPTER
2. Overview of Neural Networks 3. A Deeper Dive into Neural Networks 4. Signal Processing - Data Analysis with Neural Networks 5. Section 2: Advanced Neural Network Architectures
6. Convolutional Neural Networks 7. Recurrent Neural Networks 8. Long Short-Term Memory Networks 9. Reinforcement Learning with Deep Q-Networks 10. Section 3: Hybrid Model Architecture
11. Autoencoders 12. Generative Networks 13. Section 4: Road Ahead
14. Contemplating Present and Future Developments 15. Other Books You May Enjoy

Performing a backward pass in Q-Learning

Now, we have a defined loss metric, which computes the error between the optimal Q-function (derived from the Bellman equation) and the current Q-function at a given time. We can then propagate our prediction errors in Q-values, backwards through the model layers, as our network plays about the environment. As we are well aware of by now, this is achieved by taking the gradient of the loss function with respect to model weights, and then updating these weights in the opposite direction of the gradient per learning batch. Hence, we can iteratively update the model weights in the direction of the optimal Q-value function. We can formulate the backpropagation process and illustrate the change in model weights (theta) like so:

Eventually, as the model has seen enough state action pairs, it will sufficiently backpropagate its errors and learn...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image