You're reading from Using Stable Diffusion with Python Leverage Python to control and automate high-quality AI image generation using Stable Diffusion

Product type Paperback

Published in Jun 2024

Publisher Packt

ISBN-13 9781835086377

Length 352 pages

Edition 1st Edition

Languages

Python

Concepts

GPT/LLMs

Author (1):

Andrew Zhu (Shudong Zhu)

View More author details

Table of Contents (29) Chapters

Preface

1. Part 1 – A Whirlwind of Stable Diffusion FREE CHAPTER

2. Chapter 1: Introducing Stable Diffusion

3. Chapter 2: Setting Up the Environment for Stable Diffusion

4. Chapter 3: Generating Images Using Stable Diffusion

5. Chapter 4: Understanding the Theory Behind Diffusion Models

6. Chapter 5: Understanding How Stable Diffusion Works

7. Chapter 6: Using Stable Diffusion Models

8. Part 2 – Improving Diffusers with Custom Features

9. Chapter 7: Optimizing Performance and VRAM Usage

10. Chapter 8: Using Community-Shared LoRAs

11. Chapter 9: Using Textual Inversion

12. Chapter 10: Overcoming 77-Token Limitations and Enabling Prompt Weighting

13. Chapter 11: Image Restore and Super-Resolution

14. Chapter 12: Scheduled Prompt Parsing

15. Part 3 – Advanced Topics

16. Chapter 13: Generating Images with ControlNet

17. Chapter 14: Generating Video Using Stable Diffusion

18. Chapter 15: Generating Image Descriptions Using BLIP-2 and LLaVA

19. Chapter 16: Exploring Stable Diffusion XL

20. Chapter 17: Building Optimized Prompts for Stable Diffusion

21. Part 4 – Building Stable Diffusion into an Application

22. Chapter 18: Applications – Object Editing and Style Transferring

23. Chapter 19: Generation Data Persistence

24. Chapter 20: Creating Interactive User Interfaces

25. Chapter 21: Diffusion Model Transfer Learning

26. Chapter 22: Exploring Beyond Stable Diffusion

27. Index

Why subscribe?

28. Other Books You May Enjoy

Implementing a text-guided image-to-image Stable Diffusion inference pipeline

The only thing we need to do now is concatenate the starting image with the starting latent noise. The latents_input Torch tensor is the latent we encoded from a dog image earlier in this chapter:

strength = 0.7
# scale the initial noise by the standard deviation required by the 
# scheduler
latents = latents_input*(1-strength) + 
    noise_tensor*scheduler.init_noise_sigma

That is all that is necessary; use the same code from the text-to-image pipeline, and you should generate something like Figure 5.4:

Figure 5.4: A running dog, generated by a custom image-to-image Stable Diffusion pipeline

Note that the preceding code uses strength = 0.7; the strength denotes the weight of the original latent noise. If you want an image more similar to the initial image (the image you provided to the image-to-image pipeline), use a lower strength number; otherwise...

The rest of the chapter is locked

You're reading from Using Stable Diffusion with Python Leverage Python to control and automate high-quality AI image generation using Stable Diffusion

Table of Contents (29) Chapters

Implementing a text-guided image-to-image Stable Diffusion inference pipeline

Authors (1)

Personalised recommendations for you

You're reading from Using Stable Diffusion with Python Leverage Python to control and automate high-quality AI image generation using Stable Diffusion

Table of Contents (29) Chapters

Implementing a text-guided image-to-image Stable Diffusion inference pipeline

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you