What is ControlNet and how is it different?
In terms of “control,” you may recall textual embedding, LoRA, and the image-to-image diffusion pipeline. But what makes ControlNet different and useful?
Unlike other solutions, ControlNet is a model that works on the UNet diffusion process directly. We compare these solutions in Table 13.1:
Control Method |
Functioning Stage |
Usage Scenario |
Textual Embedding |
Text encoder |
Add a new style, a new concept, or a new face |
LoRA |
Merge LoRA weights to the UNet model (and the CLIP text encoder, optional) |
Add a set of styles, concepts, and generate content |
Image-to-Image |
Provide the initial latent image |
Fix... |