Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Hands-On Machine Learning with TensorFlow.js A guide to building ML applications integrated with web technology using the TensorFlow.js library

Product type Paperback

Published in Nov 2019

Publisher Packt

ISBN-13 9781838821739

Length 296 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Machine Learning

Author (1):

Kai Sasaki

View More author details

Table of Contents (17) Chapters

Preface

1. Section 1: The Rationale of Machine Learning and the Usage of TensorFlow.js FREE CHAPTER

2. Machine Learning for the Web

3. Importing Pretrained Models into TensorFlow.js

4. TensorFlow.js Ecosystem

5. Section 2: Real-World Applications of TensorFlow.js

6. Polynomial Regression

7. Classification with Logistic Regression

8. Unsupervised Learning

9. Sequential Data Analysis

10. Dimensionality Reduction

11. Solving the Markov Decision Process

12. Section 3: Productionizing Machine Learning Applications with TensorFlow.js

13. Deploying Machine Learning Applications

14. Tuning Applications to Achieve High Performance

15. Future Work Around TensorFlow.js

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

In this chapter, we have discussed the basic assumption of reinforcement learning and how Q-learning works by showing the simple MDP problem. Reinforcement learning is a powerful technique for solving a situation where we do not have complete knowledge of the environment itself. This leads to the desired result with a few sets of definitions naturally modeled from the environment observation. While we still carefully design the transition function between states, the deterministic transition also provides a good assumption of MDP as shown in our experiment.

Q-learning is a widely used algorithm to resolve the reinforcement learning problem. It is an iterative process to update the action-value function according to the Bellman equation. It is guaranteed to be converged, and gives us a result consistent with our expectations. While the algorithm itself looks pretty simple...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Sasaki

Kai Sasaki works as a software engineer at Treasure Data. He engages in developing largescale distributed systems to make data valuable. His passion for creating artificial intelligence by processing large-scale data led him to the field of machine learning. He is one of the initial contributors to TensorFlow.js and keeps working to add new operators that are required for new types of machine learning models. Because of his work, he received the Google Open Source Peer Bonus in 2018.

See other products by Sasaki