You're reading from AI Crash Course A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python

Product type Paperback

Published in Nov 2019

Publisher Packt

ISBN-13 9781838645359

Length 360 pages

Edition 1st Edition

Languages

Python

Concepts

Artificial Intelligence

Author (1):

Hadelin de Ponteves

View More author details

Table of Contents (17) Chapters

Preface

1. Welcome to the Robot World FREE CHAPTER

2. Discover Your AI Toolkit

3. Python Fundamentals – Learn How to Code in Python

4. AI Foundation Techniques

5. Your First AI Model – Beware the Bandits!

6. AI for Sales and Advertising – Sell like the Wolf of AI Street

7. Welcome to Q-Learning

8. AI for Logistics – Robots in a Warehouse

9. Going Pro with Artificial Brains – Deep Q-Learning

10. AI for Autonomous Vehicles – Build a Self-Driving Car

11. AI for Business – Minimize Costs with Deep Q-Learning

12. Deep Convolutional Q-Learning

13. AI for Games – Become the Master at Snake

14. Recap and Conclusion

15. Other Books You May Enjoy

Leave a review - let other readers know what you think

16. Index

Deep Q-learning

You've toured the foundations of deep learning, and you already know Q-learning; since deep Q-learning consists of combining Q-learning and deep learning, you're ready to get an intuitive grasp of deep Q-learning and crush it.

Before we start, try to guess some of how this is going to work. I would like you to take a moment and think about how you could integrate Q-learning into an ANN.

First things first, you might have guessed what the inputs and outputs of the neural network are going to be. The input of the artificial neural network is of course going to be the input state, which could be a 1-dimensional vector encoding what is happening in the environment, or an image (like the ones seen by a self-driving car). And the output is going to be the set of Q-values for each action, meaning it is going to be a 1-dimensional vector of several Q-values, one for each action that can be performed. Then, just like before, the AI takes the action...