Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Advanced Deep Learning with Python Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch

Product type Paperback

Published in Dec 2019

Publisher Packt

ISBN-13 9781789956177

Length 468 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Deep Learning

Author (1):

Ivan Vasilev

View More author details

Table of Contents (17) Chapters

Preface

1. Section 1: Core Concepts

2. The Nuts and Bolts of Neural Networks FREE CHAPTER

3. Section 2: Computer Vision

4. Understanding Convolutional Networks

5. Advanced Convolutional Networks

6. Object Detection and Image Segmentation

7. Generative Models

8. Section 3: Natural Language and Sequence Processing

9. Language Modeling

10. Understanding Recurrent Networks

11. Sequence-to-Sequence Models and Attention

12. Section 4: A Look to the Future

13. Emerging Neural Network Designs

14. Meta Learning

15. Deep Learning for Autonomous Vehicles

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Understanding transformers

We spent the better part of this chapter touting the advantages of the attention mechanism. But we still use attention in the context of RNNs—in that sense, it works as an addition on top of the core recurrent nature of these models. Since attention is so good, is there a way to use it on its own without the RNN part? It turns out that there is. The paper Attention is all you need (https://arxiv.org/abs/1706.03762) introduces a new architecture called transformer with encoder and decoder that relies solely on the attention mechanism. First, we'll focus our attention on the transformer attention (pun intended) mechanism.

The transformer attention

Before focusing on the entire model, let...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Vasilev

Ivan Vasilev started working on the first open source Java deep learning library with GPU support in 2013. The library was acquired by a German company, with whom he continued its development. He has also worked as a machine learning engineer and researcher in medical image classification and segmentation with deep neural networks. Since 2017, he has focused on financial machine learning. He co-founded an algorithmic trading company, where he's the lead engineer. He holds an MSc in artificial intelligence from Sofia University St. Kliment Ohridski and has written two previous books on the same topic.

See other products by Vasilev