Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from TensorFlow Machine Learning Projects Build 13 real-world projects with advanced numerical computations using the Python ecosystem

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781789132212

Length 322 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Machine Learning

Authors (2):

Ankit Jain

Dr. Amita Kapoor

View More author details

Table of Contents (17) Chapters

Preface

1. Overview of TensorFlow and Machine Learning FREE CHAPTER

2. Using Machine Learning to Detect Exoplanets in Outer Space

3. Sentiment Analysis in Your Browser Using TensorFlow.js

4. Digit Classification Using TensorFlow Lite

5. Speech to Text and Topic Extraction Using NLP

6. Predicting Stock Prices using Gaussian Process Regression

7. Credit Card Fraud Detection using Autoencoders

8. Generating Uncertainty in Traffic Signs Classifier Using Bayesian Neural Networks

9. Generating Matching Shoe Bags from Shoe Images Using DiscoGANs

10. Classifying Clothing Images using Capsule Networks

11. Making Quality Product Recommendations Using TensorFlow

12. Object Detection at a Large Scale with TensorFlow

13. Generating Book Scripts Using LSTMs

14. Playing Pacman Using Deep Reinforcement Learning

15. What is Next?

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Applying DQN to a game

So far, we have randomly picked an action and applied it to the game. Now, let's apply DQN for selecting actions for playing the PacMan game.

We define the q_nn policy function as follows:

def policy_q_nn(obs, env):
    # Exploration strategy - Select a random action
    if np.random.random() < explore_rate:
        action = env.action_space.sample()
    # Exploitation strategy - Select the action with the highest q
    else:
        action = np.argmax(q_nn.predict(np.array([obs])))
    return action

Next, we modify the episode function to incorporate calculation of q_values and train the neural network on the sampled experience buffer. This is shown in the following code:

def episode(env, policy, r_max=0, t_max=0):

    # create the empty list to contain game memory
    #memory = deque(maxlen=1000)
    
    # observe initial state
  ...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at AU $24.99/month. Cancel anytime

Authors (2)

Jain

Ankit Jain currently works as a senior research scientist at Uber AI Labs, the machine learning research arm of Uber. His work primarily involves the application of deep learning methods to a variety of Uber's problems, ranging from forecasting and food delivery to self-driving cars. Previously, he has worked in a variety of data science roles at the Bank of America, Facebook, and other start-ups. He has been a featured speaker at many of the top AI conferences and universities, including UC Berkeley, O'Reilly AI conference, and others. He has a keen interest in teaching and has mentored over 500 students in AI through various start-ups and bootcamps. He completed his MS at UC Berkeley and his BS at IIT Bombay (India).

See other products by Jain

Amita Kapoor

Amita Kapoor, a seasoned expert in Artificial Intelligence, serves as the Chief Artificial Intelligence Officer at TIPZ AI, bringing over 25 years of experience in AI, data science, and neuroscience. Her consultancy, NePeur, stands testament to her leadership in applying AI across diverse industries, enhancing operational efficiency and business intelligence. Amita is also a devoted board member of Neuromatch Academy, where she plays a crucial role in making neuroscience and deep learning education accessible to all. Holding a PhD from the University of Delhi, she has dedicated her career to education, authoring numerous articles and papers, and creating impactful online classes. Her significant contributions extend to pioneering projects in intelligent vehicle fleet management, home surveillance through AI-powered face detection, and robust data anonymization solutions. Connect with Amita on LinkedIn.

See other products by Amita Kapoor