You're reading from Mastering Reinforcement Learning with Python Build next-generation, self-learning models using reinforcement learning techniques and best practices

Product type Paperback

Published in Dec 2020

Publisher Packt

ISBN-13 9781838644147

Length 544 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Reinforcement Learning

Author (1):

Enes Bilgin

View More author details

Table of Contents (24) Chapters

Preface

1. Section 1: Reinforcement Learning Foundations

2. Chapter 1: Introduction to Reinforcement Learning FREE CHAPTER

3. Chapter 2: Multi-Armed Bandits

4. Chapter 3: Contextual Bandits

5. Chapter 4: Makings of a Markov Decision Process

6. Chapter 5: Solving the Reinforcement Learning Problem

7. Section 2: Deep Reinforcement Learning

8. Chapter 6: Deep Q-Learning at Scale

9. Chapter 7: Policy-Based Methods

10. Chapter 8: Model-Based Methods

11. Chapter 9: Multi-Agent Reinforcement Learning

12. Section 3: Advanced Topics in RL

13. Chapter 10: Introducing Machine Teaching

14. Chapter 11: Achieving Generalization and Overcoming Partial Observability

15. Chapter 12: Meta-Reinforcement Learning

16. Chapter 13: Exploring Advanced Topics

17. Section 4: Applications of RL

18. Chapter 14: Solving Robot Learning

19. Chapter 15: Supply Chain Management

20. Chapter 16: Personalization, Marketing, and Finance

21. Chapter 17: Smart City and Cybersecurity

22. Chapter 18: Challenges and Future Directions in Reinforcement Learning

23. Other Books You May Enjoy

Leave a review - let other readers know what you think

Why reinforcement learning?

Creating intelligent machines that make decisions at or superior to human level is a dream of many scientist and engineers, and one which is gradually becoming closer to reality. In the seven decades since the Turing test, AI research and development has been on a roller coaster. The expectations were very high initially: In the 1960s, for example, Herbert Simon (who later received the Nobel Prize in Economics) predicted that machines would be capable of doing any work humans can do within twenty years. It was this excitement that attracted big government and corporate funding flowing into AI research, only to be followed by big disappointments and a period called the "AI winter." Decades later, thanks to the incredible developments in computing, data, and algorithms, humankind is again very excited, more than ever before, in its pursuit of the AI dream.

Note

If you're not familiar with Alan Turing's instrumental work on the foundations of AI in 1950, it's worth learning more about the Turing Test here: https://youtu.be/3wLqsRLvV-c

The AI dream is certainly one of grandiosity. After all, the potential in intelligent autonomous systems is enormous. Think about how we are limited in terms of specialist medical doctors in the world. It takes years and significant intellectual and financial resources to educate them, which many countries don't have at sufficient levels. In addition, even after years of education, it is nearly impossible for a specialist to stay up-to-date with all of the scientific developments in her field, learn from the outcomes of the tens of thousands of treatments around the world, and effectively incorporate all this knowledge into practice.

Conversely, an AI model could process and learn from all this data and combine it with a rich set of information about a patient (medical history, lab results, presenting symptoms, health profile) to make diagnosis and suggest treatments. Such a model could serve even in the most rural parts of the world (as far as an internet connection and computer are available) and direct the local health personnel about the treatment. No doubt that it would revolutionize international healthcare and improve the lives of millions of people.

Note

AI is already transforming the healthcare industry. In a recent article, Google published results from an AI system surpassing human experts in breast cancer prediction using mammography readings (McKinney et al. 2020). Microsoft is collaborating with one of India's largest healthcare providers to detect cardiac illnesses using AI (Agrawal, 2018). IBM Watson for Clinical Trial Matching uses natural language processing to recommend potential treatments for patients from medical databases (https://youtu.be/grDWR7hMQQQ).

On our quest to develop AI systems that are at or superior to human level, which is -sometimes controversially- called Artificial General Intelligence (AGI), it makes sense to develop a model that can learn from its own experience - without necessarily needing a supervisor. RL is the computational framework that enables us to create such intelligent agents. To better understand the value of RL, it is important to compare it with the other ML paradigms, which we'll look into next.

You're reading from Mastering Reinforcement Learning with Python Build next-generation, self-learning models using reinforcement learning techniques and best practices

Table of Contents (24) Chapters

Why reinforcement learning?

Authors (1)

Other recommended products

Personalised recommendations for you

You're reading from Mastering Reinforcement Learning with Python Build next-generation, self-learning models using reinforcement learning techniques and best practices

Table of Contents (24) Chapters Close

Why reinforcement learning?

Authors (1)

Other recommended products

Personalised recommendations for you

Table of Contents (24) Chapters