Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Hands-On Mathematics for Deep Learning Build a solid mathematical foundation for training efficient deep neural networks

Product type Paperback

Published in Jun 2020

Publisher Packt

ISBN-13 9781838647292

Length 364 pages

Edition 1st Edition

Languages

Python

Tools

Pandas

Concepts

Deep Learning

Author (1):

Jay Dawani

View More author details

Table of Contents (19) Chapters

Preface

1. Section 1: Essential Mathematics for Deep Learning

2. Linear Algebra FREE CHAPTER

3. Vector Calculus

4. Probability and Statistics

5. Optimization

6. Graph Theory

7. Section 2: Essential Neural Networks

8. Linear Neural Networks

9. Feedforward Neural Networks

10. Regularization

11. Convolutional Neural Networks

12. Recurrent Neural Networks

13. Section 3: Advanced Deep Learning Concepts Simplified

14. Attention Mechanisms

15. Generative Models

16. Transfer and Meta Learning

17. Geometric Deep Learning

18. Other Books You May Enjoy

Leave a review - let other readers know what you think

Long short-term memory

As we saw earlier, the standard RNN does have some limitations; in particular, they suffer from the vanishing gradient problem. The LSTM architecture was proposed by Jürgen Schmidhuber (ftp://ftp.idsia.ch/pub/juergen/lstm.pdf) as a solution to the long-term dependency problem that RNNs face.

LSTM cells differ from vanilla RNN cells in a few ways. Firstly, they contain what we call a memory block, which is basically a set of recurrently connected subnets. Secondly, each of the memory blocks contains not only self-connected memory cells but also three multiplicative units that represent the input, output, and forget gates.

Let's take a look at what a single LSTM cell looks like, then we will dive into the nitty-gritty of it to gain a better understanding. In the following diagram, you can see what an LSTM block looks like and the operations that...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Dawani

Jay Dawani is a former professional swimmer turned mathematician and computer scientist. He is also a Forbes 30 Under 30 Fellow. At present, he is the Director of Artificial Intelligence at Geometric Energy Corporation (NATO CAGE) and the CEO of Lemurian Labs - a startup he founded that is developing the next generation of autonomy, intelligent process automation, and driver intelligence. Previously he has also been the technology and R&D advisor to Spacebit Capital. He has spent the last three years researching at the frontiers of AI with a focus on reinforcement learning, open-ended learning, deep learning, quantum machine learning, human-machine interaction, multi-agent and complex systems, and artificial general intelligence.

See other products by Dawani