You're reading from Generative AI Foundations in Python Discover key techniques and navigate modern challenges in LLMs

Product type Paperback

Published in Jul 2024

Publisher Packt

ISBN-13 9781835460825

Length 190 pages

Edition 1st Edition

Languages

Python

Tools

Transfer learning

Concepts

Artificial Intelligence

Author (1):

Carlos Rodriguez

View More author details

Table of Contents (13) Chapters

Preface

1. Part 1: Foundations of Generative AI and the Evolution of Large Language Models FREE CHAPTER

2. Chapter 1: Understanding Generative AI: An Introduction

3. Chapter 2: Surveying GenAI Types and Modes: An Overview of GANs, Diffusers, and Transformers

4. Chapter 3: Tracing the Foundations of Natural Language Processing and the Impact of the Transformer

5. Chapter 4: Applying Pretrained Generative Models: From Prototype to Production

6. Part 2: Practical Applications of Generative AI

7. Chapter 5: Fine-Tuning Generative Models for Specific Tasks

8. Chapter 6: Understanding Domain Adaptation for Large Language Models

9. Chapter 7: Mastering the Fundamentals of Prompt Engineering

10. Chapter 8: Addressing Ethical Considerations and Charting a Path Toward Trustworthy Generative AI

11. Index

Why subscribe?

12. Other Books You May Enjoy

PEFT

Traditional fine-tuning methods become increasingly impractical as the model size grows due to the immense computational resources and time required to train and update all model parameters. For most businesses, including larger organizations, a classical approach to fine-tuning is cost-prohibitive and, effectively, a non-starter.

Alternatively, PEFT methods modify only a small subset of a model’s parameters, reducing the computational burden while still achieving state-of-the-art performance. This method is advantageous for adapting large models to specific tasks without extensive retraining.

One such PEFT method is the Low-Rank Adaptation (LoRA) methodology, developed by Hu et al. (2021).

LoRA

The LoRA method focuses on selectively fine-tuning specific components within the Transformer architecture to enhance efficiency and effectiveness in LLMS. LoRA targets the weight matrices found in the self-attention module of the Transformer, which, as discussed in...