Packt+ | Advance your knowledge in tech

You're reading from Python Reinforcement Learning Solve complex real-world problems by mastering reinforcement learning algorithms using OpenAI Gym and TensorFlow

Product type Course

Published in Apr 2019

Publisher Packt

ISBN-13 9781838649777

Length 496 pages

Edition 1st Edition

Languages

Python

Tools

OpenAI Gym

Concepts

Reinforcement Learning

Authors (4):

Yang Wenzhuo

Sean Saito

Sudharsan Ravichandiran

Rajalingappaa Shanmugamani

View More author details

Table of Contents (27) Chapters

Title Page

About Packt

Contributors

Preface

1. Introduction to Reinforcement Learning FREE CHAPTER

2. Getting Started with OpenAI and TensorFlow

3. The Markov Decision Process and Dynamic Programming

4. Gaming with Monte Carlo Methods

5. Temporal Difference Learning

6. Multi-Armed Bandit Problem

7. Playing Atari Games

8. Atari Games with Deep Q Network

9. Playing Doom with a Deep Recurrent Q Network

10. The Asynchronous Advantage Actor Critic Network

11. Policy Gradients and Optimization

12. Balancing CartPole

13. Simulating Control Tasks

14. Building Virtual Worlds in Minecraft

15. Learning to Play Go

16. Creating a Chatbot

17. Generating a Deep Learning Image Classifier

18. Predicting Future Stock Prices

19. Capstone Project - Car Racing Using DQN

20. Looking Ahead

1. Assessments

2. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Neural Architecture Search

The next few sections will describe the NAS framework. You will learn about how the framework learns to generate other neural networks to complete tasks using a popular reinforcement learning scheme called REINFORCE, which is a type of policy gradient algorithm.

Generating and training child networks

Research on algorithms that generate neural architectures has been around since the 1970's. What sets NAS apart from previous works is its ability to cater to large-scale deep learning algorithms and its formulation of the task as a reinforcement learning problem. More specifically, the agent, which we will refer to as the Controller, is a recurrent neural network that generates a sequence of values. You can think of these values as a sort of genetic code of the child network that defines its architecture; it sets the sizes of each convolutional kernel, the length of each kernel, the number of filters in each layer, and so on. In more advanced frameworks, the values...

The rest of the chapter is locked

You're reading from Python Reinforcement Learning Solve complex real-world problems by mastering reinforcement learning algorithms using OpenAI Gym and TensorFlow

Table of Contents (27) Chapters

Neural Architecture Search

Generating and training child networks

Unlock this book and the full library FREE for 7 days

Authors (4)

Personalised recommendations for you