Packt+ | Advance your knowledge in tech

You're reading from Python: Advanced Guide to Artificial Intelligence Expert machine learning systems and intelligent agents using Python

Product type Course

Published in Dec 2018

Publisher Packt

ISBN-13 9781789957211

Length 764 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Artificial Intelligence

Authors (2):

Giuseppe Bonaccorso

Rajalingappaa Shanmugamani

View More author details

Table of Contents (31) Chapters

Title Page

About Packt

Contributors

Preface

1. Machine Learning Model Fundamentals FREE CHAPTER

2. Introduction to Semi-Supervised Learning

3. Graph-Based Semi-Supervised Learning

4. Bayesian Networks and Hidden Markov Models

5. EM Algorithm and Applications

6. Hebbian Learning and Self-Organizing Maps

7. Clustering Algorithms

8. Advanced Neural Models

9. Classical Machine Learning with TensorFlow

10. Neural Networks and MLP with TensorFlow and Keras

11. RNN with TensorFlow and Keras

12. CNN with TensorFlow and Keras

13. Autoencoder with TensorFlow and Keras

14. TensorFlow Models in Production with TF Serving

15. Deep Reinforcement Learning

16. Generative Adversarial Networks

17. Distributed Models with TensorFlow Clusters

18. Debugging TensorFlow Models

19. Tensor Processing Units

20. Getting Started

21. Image Classification

22. Image Retrieval

23. Object Detection

24. Semantic Segmentation

25. Similarity Learning

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Reinforcement learning 101

Reinforcement learning is described by an agent getting inputs of the observation and reward from the previous time-step and producing output as an action with the goal of maximizing cumulative rewards.

The agent has a policy, value function, and model:

The algorithm used by the agent to pick the next action is known as the policy. In the previous section, we wrote a policy that would take a set of parameters theta and would return the next action based on the multiplication between the observation and the parameters. The policy is represented by the following equation:
,S is set of states and A is set of actions.
A policy is deterministic or stochastic.
- A deterministic policy returns the same action for the same state in each run:
- A stochastic policy returns the different probabilities for the same action for the same state in each run:
The value function predicts the amount of long-term reward based on the selected action in the current state. Thus, the value function...

The rest of the chapter is locked

You're reading from Python: Advanced Guide to Artificial Intelligence Expert machine learning systems and intelligent agents using Python

Table of Contents (31) Chapters

Reinforcement learning 101

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you