You're reading from Python Deep Learning Understand how deep neural networks work and apply them to real-world tasks

Product type Paperback

Published in Nov 2023

Publisher Packt

ISBN-13 9781837638505

Length 362 pages

Edition 3rd Edition

Languages

Python

Tools

Keras

Concepts

Deep Learning

Author (1):

Ivan Vasilev

View More author details

Table of Contents (17) Chapters

Preface

1. Part 1:Introduction to Neural Networks

2. Chapter 1: Machine Learning – an Introduction FREE CHAPTER

3. Chapter 2: Neural Networks

4. Chapter 3: Deep Learning Fundamentals

5. Part 2: Deep Neural Networks for Computer Vision

6. Chapter 4: Computer Vision with Convolutional Networks

7. Chapter 5: Advanced Computer Vision Applications

8. Part 3: Natural Language Processing and Transformers

9. Chapter 6: Natural Language Processing and Recurrent Neural Networks

10. Chapter 7: The Attention Mechanism and Transformers

11. Chapter 8: Exploring Large Language Models in Depth

12. Chapter 9: Advanced Applications of Large Language Models

13. Part 4: Developing and Deploying Deep Neural Networks

14. Chapter 10: Machine Learning Operations (MLOps)

15. Index

Why subscribe?

16. Other Books You May Enjoy

7

The Attention Mechanism and Transformers

In Chapter 6, we outlined a typical natural language processing (NLP) pipeline, and we introduced recurrent neural networks (RNNs) as a candidate architecture for NLP tasks. But we also outlined their drawbacks—they are inherently sequential (that is, not parallelizable) and cannot process longer sequences, because of the limitations of their internal sequence representation. In this chapter, we’ll introduce the attention mechanism, which allows a neural network (NN) to have direct access to the whole input sequence. We’ll briefly discuss the attention mechanism in the context of RNNs since it was first introduced as an RNN extension. However, the star of this chapter will be the transformer—a recent NN architecture that relies entirely on attention. Transformers have been one of the most important NN innovations in the past 10 years. They are at the core of all recent large language models (LLMs), such as ChatGPT...

The rest of the chapter is locked

You're reading from Python Deep Learning Understand how deep neural networks work and apply them to real-world tasks

Table of Contents (17) Chapters

7

The Attention Mechanism and Transformers

Authors (1)

Personalised recommendations for you

You're reading from Python Deep Learning Understand how deep neural networks work and apply them to real-world tasks

Table of Contents (17) Chapters

7

The Attention Mechanism and Transformers

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you