You're reading from AI Crash Course A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python

Product type Paperback

Published in Nov 2019

Publisher Packt

ISBN-13 9781838645359

Length 360 pages

Edition 1st Edition

Languages

Python

Concepts

Artificial Intelligence

Author (1):

Hadelin de Ponteves

View More author details

Table of Contents (17) Chapters

Preface

1. Welcome to the Robot World FREE CHAPTER

2. Discover Your AI Toolkit

3. Python Fundamentals – Learn How to Code in Python

4. AI Foundation Techniques

5. Your First AI Model – Beware the Bandits!

6. AI for Sales and Advertising – Sell like the Wolf of AI Street

7. Welcome to Q-Learning

8. AI for Logistics – Robots in a Warehouse

9. Going Pro with Artificial Brains – Deep Q-Learning

10. AI for Autonomous Vehicles – Build a Self-Driving Car

11. AI for Business – Minimize Costs with Deep Q-Learning

12. Deep Convolutional Q-Learning

13. AI for Games – Become the Master at Snake

14. Recap and Conclusion

15. Other Books You May Enjoy

Leave a review - let other readers know what you think

16. Index

Summary

In this chapter, you learned the five fundamental principles of Artificial Intelligence from a Reinforcement Learning perspective. Firstly, an AI is a system that takes an observation (values, images, or any data) as input, and returns an action to perform as output (principle #1). Then, there is a reward system that helps it measure its performance. The AI will learn through trial and error based on the reward it gets over time (principle #2). The input (state), the output (action), and the reward system define the AI environment (principle #3). The AI interacts with this environment through the Markov decision process (principle #4). Finally, in training mode, the AI learns how to maximize its total reward by updating its parameters through the iterations, and in inference mode, the AI simply performs its actions over full episodes without updating any of its parameters – that is to say, without learning (principle #5).

In the next chapter, you will...