You're reading from Using Stable Diffusion with Python Leverage Python to control and automate high-quality AI image generation using Stable Diffusion

Product type Paperback

Published in Jun 2024

Publisher Packt

ISBN-13 9781835086377

Length 352 pages

Edition 1st Edition

Languages

Python

Concepts

GPT/LLMs

Author (1):

Andrew Zhu (Shudong Zhu)

View More author details

Table of Contents (29) Chapters

Preface

1. Part 1 – A Whirlwind of Stable Diffusion

2. Chapter 1: Introducing Stable Diffusion FREE CHAPTER

3. Chapter 2: Setting Up the Environment for Stable Diffusion

4. Chapter 3: Generating Images Using Stable Diffusion

5. Chapter 4: Understanding the Theory Behind Diffusion Models

6. Chapter 5: Understanding How Stable Diffusion Works

7. Chapter 6: Using Stable Diffusion Models

8. Part 2 – Improving Diffusers with Custom Features

9. Chapter 7: Optimizing Performance and VRAM Usage

10. Chapter 8: Using Community-Shared LoRAs

11. Chapter 9: Using Textual Inversion

12. Chapter 10: Overcoming 77-Token Limitations and Enabling Prompt Weighting

13. Chapter 11: Image Restore and Super-Resolution

14. Chapter 12: Scheduled Prompt Parsing

15. Part 3 – Advanced Topics

16. Chapter 13: Generating Images with ControlNet

17. Chapter 14: Generating Video Using Stable Diffusion

18. Chapter 15: Generating Image Descriptions Using BLIP-2 and LLaVA

19. Chapter 16: Exploring Stable Diffusion XL

20. Chapter 17: Building Optimized Prompts for Stable Diffusion

21. Part 4 – Building Stable Diffusion into an Application

22. Chapter 18: Applications – Object Editing and Style Transferring

23. Chapter 19: Generation Data Persistence

24. Chapter 20: Creating Interactive User Interfaces

25. Chapter 21: Diffusion Model Transfer Learning

26. Chapter 22: Exploring Beyond Stable Diffusion

27. Index

Why subscribe?

28. Other Books You May Enjoy

Optimization solution 2 – enabling VAE tiling

Stable Diffusion VAE tiling is a technique that can be used to generate large images. It works by splitting an image into small tiles and then generating each tile separately. This technique allows the generation of large images without using too much VRAM.

Note that the result of tiled encoding and decoding will differ unnoticeably from the non-tiled version. Diffusers’ implementation of VAE tiling uses overlap tiles to blend edges to form a much smoother output.

You can turn on VAE tiling by adding the one-line code, text2img_pipe.enable_vae_tiling(), before inferencing:

import torch
from diffusers import StableDiffusionPipeline
text2img_pipe = StableDiffusionPipeline.from_pretrained(
    "runwayml/stable-diffusion-v1-5",
    torch_dtype = torch.float16       # <- load float16 version weight
).to("cuda:0")
text2img_pipe...

The rest of the chapter is locked

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Andrew Zhu (Shudong Zhu)

Andrew Zhu is an experienced Microsoft Applied Data Scientist with over 15 years of experience in the tech field. He is a highly regarded writer known for his ability to explain complex concepts in machine learning and AI in an engaging and informative manner. Andrew frequently contributes articles to Toward Data Science and other prominent tech publishers. He has authored the book "Microsoft Workflow Foundation 4.0 Cookbook," which has received a 4.5-star review. Andrew has a strong command of programming languages such as C/C++, Java, C#, and Javascript, with his current focus primarily on Python. With a passion for AI and Automation, Andrew resides in WA, US, with his family, which includes two boys.

See other products by Andrew Zhu (Shudong Zhu)