You're reading from Python Deep Learning Understand how deep neural networks work and apply them to real-world tasks

Product type Paperback

Published in Nov 2023

Publisher Packt

ISBN-13 9781837638505

Length 362 pages

Edition 3rd Edition

Languages

Python

Tools

Keras

Concepts

Deep Learning

Author (1):

Ivan Vasilev

View More author details

Table of Contents (17) Chapters

Preface

1. Part 1:Introduction to Neural Networks

2. Chapter 1: Machine Learning – an Introduction FREE CHAPTER

3. Chapter 2: Neural Networks

4. Chapter 3: Deep Learning Fundamentals

5. Part 2: Deep Neural Networks for Computer Vision

6. Chapter 4: Computer Vision with Convolutional Networks

7. Chapter 5: Advanced Computer Vision Applications

8. Part 3: Natural Language Processing and Transformers

9. Chapter 6: Natural Language Processing and Recurrent Neural Networks

10. Chapter 7: The Attention Mechanism and Transformers

11. Chapter 8: Exploring Large Language Models in Depth

12. Chapter 9: Advanced Applications of Large Language Models

13. Part 4: Developing and Deploying Deep Neural Networks

14. Chapter 10: Machine Learning Operations (MLOps)

15. Index

Why subscribe?

16. Other Books You May Enjoy

Introducing LLMs

In this section, we’ll take a more systematic approach and dive deeper into transformer-based architectures. As we mentioned in the introduction, the transformer block has changed remarkedly little since its introduction in 2017. Instead, the main advances have come in terms of larger models and larger training sets. For example, the original GPT model (GPT-1) has 117M parameters, while GPT-3 (Language Models are Few-Shot Learners, https://arxiv.org/abs/2005.14165) has 175B, a thousandfold increase. We can distinguish two informal transformer model categories based on size:

Pre-trained language models (PLMs): Transformers with fewer parameters, such as Bidirectional Encoder Representations from Transformers (BERT) and generative pre-trained transformers (GPT), fall into this category. Starting with BERT, these transformers introduced the two-step pre-training/FT paradigm. The combination of the attention mechanism and unsupervised pre-training (masked...

The rest of the chapter is locked

You're reading from Python Deep Learning Understand how deep neural networks work and apply them to real-world tasks

Table of Contents (17) Chapters

Introducing LLMs

Authors (1)

Personalised recommendations for you

You're reading from Python Deep Learning Understand how deep neural networks work and apply them to real-world tasks

Table of Contents (17) Chapters

Introducing LLMs

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you