You're reading from Python Deep Learning Understand how deep neural networks work and apply them to real-world tasks

Product type Paperback

Published in Nov 2023

Publisher Packt

ISBN-13 9781837638505

Length 362 pages

Edition 3rd Edition

Languages

Python

Tools

Keras

Concepts

Deep Learning

Author (1):

Ivan Vasilev

View More author details

Table of Contents (17) Chapters

Preface

1. Part 1:Introduction to Neural Networks

2. Chapter 1: Machine Learning – an Introduction FREE CHAPTER

3. Chapter 2: Neural Networks

4. Chapter 3: Deep Learning Fundamentals

5. Part 2: Deep Neural Networks for Computer Vision

6. Chapter 4: Computer Vision with Convolutional Networks

7. Chapter 5: Advanced Computer Vision Applications

8. Part 3: Natural Language Processing and Transformers

9. Chapter 6: Natural Language Processing and Recurrent Neural Networks

10. Chapter 7: The Attention Mechanism and Transformers

11. Chapter 8: Exploring Large Language Models in Depth

12. Chapter 9: Advanced Applications of Large Language Models

13. Part 4: Developing and Deploying Deep Neural Networks

14. Chapter 10: Machine Learning Operations (MLOps)

15. Index

Why subscribe?

16. Other Books You May Enjoy

9

Advanced Applications of Large Language Models

In the previous two chapters, we introduced the transformer architecture and learned about its latest large-scale incarnations, known as large language models (LLMs). We discussed them in the context of natural language processing (NLP) tasks. NLP was the original transformer application and is still the field at the forefront of LLM development today. However, the success of the architecture has led the research community to explore the application of transformers in other areas, such as computer vision.

In this chapter, we’ll focus on these areas. We’ll discuss transformers as replacements for convolutional networks (CNNs, Chapter 4) for tasks such as image classification and object detection. We’ll also learn how to use them as generative models for images instead of text, as we have done until now. We’ll also implement a model fine-tuning example – something we failed to do in Chapter 8. And...