Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Deep Learning for Beginners A beginner's guide to getting up and running with deep learning from scratch using Python

Product type Paperback

Published in Sep 2020

Publisher Packt

ISBN-13 9781838640859

Length 432 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Deep Learning

Authors (2):

Pablo Rivas

Dr. Pablo Rivas

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: Getting Up to Speed

2. Introduction to Machine Learning FREE CHAPTER

3. Setup and Introduction to Deep Learning Frameworks

4. Preparing Data

5. Learning from Data

6. Training a Single Neuron

7. Training Multiple Layers of Neurons

8. Section 2: Unsupervised Deep Learning

9. Autoencoders

10. Deep Autoencoders

11. Variational Autoencoders

12. Restricted Boltzmann Machines

13. Section 3: Supervised Deep Learning

14. Deep and Wide Neural Networks

15. Convolutional Neural Networks

16. Recurrent Neural Networks

17. Generative Adversarial Networks

18. Final Remarks on the Future of Deep Learning

19. Other Books You May Enjoy

Leave a review - let other readers know what you think

Long short-term memory models

Initially proposed by Hochreiter, Long Short-Term Memory Models (LSTMs) gained traction as an improved version of recurrent models [Hochreiter, S., et al. (1997)]. LSTMs promised to alleviate the following problems associated with traditional RNNs:

Vanishing gradients
Exploding gradients
The inability to remember or forget certain aspects of the input sequences

The following diagram shows a very simplified version of an LSTM. In (b), we can see the additional self-loop that is attached to some memory, and in (c), we can observe what the network looks like when unfolded or expanded:

Figure 13.6. Simplified representation of an LSTM

There is much more to the model, but the most essential elements are shown in Figure 13.6. Observe how an LSTM layer receives from the previous time step not only the previous output, but also something called state, which acts as a type of memory. In the diagram, you can see that while the current output and state are available...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Rivas

Dr. Pablo Rivas is an assistant professor of computer science at Baylor University in Texas. He worked in industry for a decade as a software engineer before becoming an academic. He is a senior member of the IEEE, ACM, and SIAM. He was formerly at NASA Goddard Space Flight Center performing research. He is an ally of women in technology, a deep learning evangelist, machine learning ethicist, and a proponent of the democratization of machine learning and artificial intelligence in general. He teaches machine learning and deep learning. Dr. Rivas is a published author and all his papers are related to machine learning, computer vision, and machine learning ethics. Dr. Rivas prefers Vim to Emacs and spaces to tabs.

See other products by Rivas

Pablo Rivas

See other products by Pablo Rivas