Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Deep Learning with Go A practical guide to building and implementing neural network models using Go

Product type Paperback

Published in Aug 2019

Publisher Packt

ISBN-13 9781789340990

Length 242 pages

Edition 1st Edition

Languages

Tools

CUDA

Concepts

Deep Learning

Authors (2):

Darrell Chua

Gareth Seneque

View More author details

Table of Contents (15) Chapters

Preface

1. Section 1: Deep Learning in Go, Neural Networks, and How to Train Them

2. Introduction to Deep Learning in Go FREE CHAPTER

3. What Is a Neural Network and How Do I Train One?

4. Beyond Basic Neural Networks - Autoencoders and RBMs

5. CUDA - GPU-Accelerated Training

6. Section 2: Implementing Deep Neural Network Architectures

7. Next Word Prediction with Recurrent Neural Networks

8. Object Recognition with Convolutional Neural Networks

9. Maze Solving with Deep Q-Networks

10. Generative Models with Variational Autoencoders

11. Section 3: Pipeline, Deployment, and Beyond!

12. Building a Deep Learning Pipeline

13. Scaling Deployment

14. Other Books You May Enjoy

Leave a review - let other readers know what you think

RNNs and vanishing gradients

RNNs themselves are an important architectural innovation, but run into problems in terms of their gradients vanishing. When gradient values become so small that the updates are equally tiny, this slows or even halts learning. Your digital neurons die, and your network doesn't do what you want it to do. But is a neural network with a bad memory better than one with no memory at all?

Let's zoom in a bit and discuss what's actually going on when you run into this problem. Recall the formula for calculating the value for a given weight during backpropagation:

W = W - LR*G

Here, the weight value equals the weight minus (learning rate multiplied by the gradient).

Your network is propagating error derivatives across layers and across timesteps. The larger your dataset, the greater the number of timesteps and parameters, and so the greater...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Seneque

Gareth Seneque is a machine learning engineer with 11 years' experience of building and deploying systems at scale in the finance and media industries. He became interested in deep learning in 2014 and is currently building a search platform within his organization, using neuro-linguistic programming and other machine learning techniques to generate content metadata and drive recommendations. He has contributed to a number of open source projects, including CoREBench and Gorgonia. He also has extensive experience with modern DevOps practices, using AWS, Docker, and Kubernetes to effectively distribute the processing of machine learning workloads.

See other products by Seneque

Chua

Darrell Chua is a senior data scientist with more than 10 years' experience. He has developed models of varying complexity, from building credit scorecards with logistic regression to creating image classification models for trading cards. He has spent the majority of his time working with in fintech companies, trying to bring machine learning technologies into the world of finance. He has been programming in Go for several years and has been working on deep learning models for even longer. Among his achievements is the creation of numerous business intelligence and data science pipelines that enable the delivery of a top-of-the-line automated underwriting system, producing near-instant approval decisions.

See other products by Chua