You're reading from Using Stable Diffusion with Python Leverage Python to control and automate high-quality AI image generation using Stable Diffusion

Product type Paperback

Published in Jun 2024

Publisher Packt

ISBN-13 9781835086377

Length 352 pages

Edition 1st Edition

Languages

Python

Concepts

GPT/LLMs

Author (1):

Andrew Zhu (Shudong Zhu)

View More author details

Table of Contents (29) Chapters

Preface

1. Part 1 – A Whirlwind of Stable Diffusion FREE CHAPTER

2. Chapter 1: Introducing Stable Diffusion

3. Chapter 2: Setting Up the Environment for Stable Diffusion

4. Chapter 3: Generating Images Using Stable Diffusion

5. Chapter 4: Understanding the Theory Behind Diffusion Models

6. Chapter 5: Understanding How Stable Diffusion Works

7. Chapter 6: Using Stable Diffusion Models

8. Part 2 – Improving Diffusers with Custom Features

9. Chapter 7: Optimizing Performance and VRAM Usage

10. Chapter 8: Using Community-Shared LoRAs

11. Chapter 9: Using Textual Inversion

12. Chapter 10: Overcoming 77-Token Limitations and Enabling Prompt Weighting

13. Chapter 11: Image Restore and Super-Resolution

14. Chapter 12: Scheduled Prompt Parsing

15. Part 3 – Advanced Topics

16. Chapter 13: Generating Images with ControlNet

17. Chapter 14: Generating Video Using Stable Diffusion

18. Chapter 15: Generating Image Descriptions Using BLIP-2 and LLaVA

19. Chapter 16: Exploring Stable Diffusion XL

20. Chapter 17: Building Optimized Prompts for Stable Diffusion

21. Part 4 – Building Stable Diffusion into an Application

22. Chapter 18: Applications – Object Editing and Style Transferring

23. Chapter 19: Generation Data Persistence

24. Chapter 20: Creating Interactive User Interfaces

25. Chapter 21: Diffusion Model Transfer Learning

26. Chapter 22: Exploring Beyond Stable Diffusion

27. Index

Why subscribe?

28. Other Books You May Enjoy

How does LoRA work?

LoRA is a technique for quickly fine-tuning diffusion models, first introduced by Microsoft researchers in a paper by Edward J. Hu et al [1]. It works by creating a small, low-rank model that is adapted for a specific concept. This small model can be merged with the main checkpoint model to generate images similar to the ones used to train LoRA.

Let’s use W to denote the original UNet attention weights (Q,K,V), ΔW to denote the fine-tuned weights from LoRA, and W′ as the merged weights. The process of adding LoRA to a model can be expressed like this:

W′= W + ΔW

If we want to control the scale of LoRA weights, we denote the scale as α. Adding LoRA to a model can be expressed like this now:

W′= W + αΔW

The range of α can be from 0 to 1.0 [2]. It should be fine if we set α slightly larger than 1.0. The reason why LoRA is so small is that ΔW can be represented by two small matrices...

The rest of the chapter is locked

You're reading from Using Stable Diffusion with Python Leverage Python to control and automate high-quality AI image generation using Stable Diffusion

Table of Contents (29) Chapters

How does LoRA work?

Authors (1)

Personalised recommendations for you

You're reading from Using Stable Diffusion with Python Leverage Python to control and automate high-quality AI image generation using Stable Diffusion

Table of Contents (29) Chapters

How does LoRA work?

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you