Packt+ | Advance your knowledge in tech

You're reading from Python: Advanced Guide to Artificial Intelligence Expert machine learning systems and intelligent agents using Python

Product type Course

Published in Dec 2018

Publisher Packt

ISBN-13 9781789957211

Length 764 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Artificial Intelligence

Authors (2):

Giuseppe Bonaccorso

Rajalingappaa Shanmugamani

View More author details

Table of Contents (31) Chapters

Title Page

About Packt

Contributors

Preface

1. Machine Learning Model Fundamentals FREE CHAPTER

2. Introduction to Semi-Supervised Learning

3. Graph-Based Semi-Supervised Learning

4. Bayesian Networks and Hidden Markov Models

5. EM Algorithm and Applications

6. Hebbian Learning and Self-Organizing Maps

7. Clustering Algorithms

8. Advanced Neural Models

9. Classical Machine Learning with TensorFlow

10. Neural Networks and MLP with TensorFlow and Keras

11. RNN with TensorFlow and Keras

12. CNN with TensorFlow and Keras

13. Autoencoder with TensorFlow and Keras

14. TensorFlow Models in Production with TF Serving

15. Deep Reinforcement Learning

16. Generative Adversarial Networks

17. Distributed Models with TensorFlow Clusters

18. Debugging TensorFlow Models

19. Tensor Processing Units

20. Getting Started

21. Image Classification

22. Image Retrieval

23. Object Detection

24. Semantic Segmentation

25. Similarity Learning

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Applying simple policies to a cartpole game

So far, we have randomly picked an action and applied it. Now let us apply some logic to picking the action instead of random chance. The third observation refers to the angle. If the angle is greater than zero, that means the pole is tilting right, thus we move the cart to the right (1). Otherwise, we move the cart to the left (0). Let us look at an example:

We define two policy functions as follows:

def policy_logic(env,obs):
return 1 if obs[2] > 0 else 0
def policy_random(env,obs):
return env.action_space.sample()

Next, we define an experiment function that will run for a specific number of episodes; each episode runs until the game is lost, namely when done is True. We use rewards_max to indicate when to break out of the loop as we do not wish to run the experiment forever:

def experiment(policy, n_episodes, rewards_max):
rewards=np.empty(shape=(n_episodes))
    env = gym.make('CartPole-v0')

for i in range(n_episodes):
obs = env.reset(...

The rest of the chapter is locked

You're reading from Python: Advanced Guide to Artificial Intelligence Expert machine learning systems and intelligent agents using Python

Table of Contents (31) Chapters

Applying simple policies to a cartpole game

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you