Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Intelligent Projects Using Python 9 real-world AI projects leveraging machine learning and deep learning with TensorFlow and Keras

Product type Paperback

Published in Jan 2019

Publisher Packt

ISBN-13 9781788996921

Length 342 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Artificial Intelligence

Author (1):

Santanu Pattanayak

View More author details

Table of Contents (12) Chapters

Preface

1. Foundations of Artificial Intelligence Based Systems FREE CHAPTER

2. Transfer Learning

3. Neural Machine Translation

4. Style Transfer in Fashion Industry using GANs

5. Video Captioning Application

6. The Intelligent Recommender System

7. Mobile App for Movie Review Sentiment Analysis

8. Conversational AI Chatbots for Customer Service

9. Autonomous Self-Driving Car Through Reinforcement Learning

10. CAPTCHA from a Deep-Learning Perspective

11. Other Books You May Enjoy

Leave a review - let other readers know what you think

Learning the Q value function

For an RL agent to make a decision, it is important for the agent to learn the Q value function. The Q value function can be learned iteratively via Bellman's equation. When the agent starts to interact with the environment, it starts with a random state s⁽⁰⁾ and random state of Q values for every state action pair. The agent's action would also be somewhat random, since it has no state Q values to make informed decisions. For each action taken, the environment would return a reward based on which agent starts to build the Q value tables, and improves over time.

At any exposed state s^(t) at iteration t the agent would take an action a^(t) that maximizes its long-term reward. The Q table holds the long-term reward values, and hence the chosen a^(t) would be based on the following heuristics:

The Q value table is also indexed by iteration...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Pattanayak

Santanu Pattanayak works as a Staff Machine Learning Specialist at Qualcomm Corp R&D and is an author of the deep learning book Pro Deep Learning with TensorFlow - A Mathematical Approach to Advanced Artificial Intelligence in Python. He has around 12 years of work experience and has worked at GE, Capgemini, and IBM before joining Qualcomm. He graduated with a degree in electrical engineering from Jadavpur University, Kolkata and is an avid math enthusiast. Santanu is currently pursuing a master's degree in data science from Indian Institute of Technology (IIT), Hyderabad. He also participates in Kaggle competitions in his spare time where he ranks in top 500. Currently, he resides in Bangalore with his wife.

See other products by Pattanayak