Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Hands-On Neural Networks with Keras

You're reading from   Hands-On Neural Networks with Keras Design and create neural networks using deep learning and artificial intelligence principles

Arrow left icon
Product type Paperback
Published in Mar 2019
Publisher Packt
ISBN-13 9781789536089
Length 462 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Niloy Purkait Niloy Purkait
Author Profile Icon Niloy Purkait
Niloy Purkait
Arrow right icon
View More author details
Toc

Table of Contents (16) Chapters Close

Preface 1. Section 1: Fundamentals of Neural Networks FREE CHAPTER
2. Overview of Neural Networks 3. A Deeper Dive into Neural Networks 4. Signal Processing - Data Analysis with Neural Networks 5. Section 2: Advanced Neural Network Architectures
6. Convolutional Neural Networks 7. Recurrent Neural Networks 8. Long Short-Term Memory Networks 9. Reinforcement Learning with Deep Q-Networks 10. Section 3: Hybrid Model Architecture
11. Autoencoders 12. Generative Networks 13. Section 4: Road Ahead
14. Contemplating Present and Future Developments 15. Other Books You May Enjoy

Markov decision process

In reinforcement learning, we are trying to solve the problem of correlating immediate actions with the delayed rewards they return. These rewards are simply sparse, time-delayed labels that are used to control the agent's behavior. So far, we have discussed how an agent may act upon different states of an environment. We also saw how interactions generate various rewards for the agent and unlock new states of the environment. From here, the agent can resume interacting with the environment until the end of an episode. It's about time we mathematically formalize these relations between an agent and environment for the purpose of goal optimization. To do this, we will call upon a framework proposed by Russian mathematician Andrey Markov, now known as the Markov decision process (MDP).

This mathematical framework allows us to model our agent&apos...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at €18.99/month. Cancel anytime