Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Advanced Deep Learning with Python Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch

Product type Paperback

Published in Dec 2019

Publisher Packt

ISBN-13 9781789956177

Length 468 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Deep Learning

Author (1):

Ivan Vasilev

View More author details

Table of Contents (17) Chapters

Preface

1. Section 1: Core Concepts

2. The Nuts and Bolts of Neural Networks FREE CHAPTER

3. Section 2: Computer Vision

4. Understanding Convolutional Networks

5. Advanced Convolutional Networks

6. Object Detection and Image Segmentation

7. Generative Models

8. Section 3: Natural Language and Sequence Processing

9. Language Modeling

10. Understanding Recurrent Networks

11. Sequence-to-Sequence Models and Attention

12. Section 4: A Look to the Future

13. Emerging Neural Network Designs

14. Meta Learning

15. Deep Learning for Autonomous Vehicles

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Introducing memory-augmented NNs

We've already seen the concept of memory (albeit in a strange form) in NNs—for example, the LSTM cell can add or delete information on its hidden cell state with the help of the input and the forget gates. Another example is the attention mechanism, where the set of vectors that represent the encoded source sequence can be viewed as external memory that is written to by the encoder and read from by the decoder. But this ability comes with some limitations. For one, the encoder can only write to a single memory location, which is the current element of the sequence. It also cannot update previously written vectors. On the other hand, the decoder can only read from the database, but cannot write to it.

In this section, we'll take the concept of memory one step further and look at Memory-Augmented NNs (MANNs), which resolve these...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Vasilev

Ivan Vasilev started working on the first open source Java deep learning library with GPU support in 2013. The library was acquired by a German company, with whom he continued its development. He has also worked as a machine learning engineer and researcher in medical image classification and segmentation with deep neural networks. Since 2017, he has focused on financial machine learning. He co-founded an algorithmic trading company, where he's the lead engineer. He holds an MSc in artificial intelligence from Sofia University St. Kliment Ohridski and has written two previous books on the same topic.

See other products by Vasilev