StableDiffusionUpscalePipeline #1396

patil-suraj · 2022-11-24T14:19:13Z

This PR adds StableDiffusionUpscalePipeline

HuggingFaceDocBuilderDev · 2022-11-24T14:23:08Z

The documentation is not available anymore as the PR was closed or merged.

pcuenca

Awesome!

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py

pcuenca · 2022-11-25T12:39:59Z

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py

+        noise_level = torch.cat([noise_level] * 2) if do_classifier_free_guidance else noise_level
+
+        # 6. Prepare latent variables
+        height, width = image.shape[2:]


If I understand this correctly, the latents have the same size as the input image, right? In Katherine's upscaler the low-res image was upscaled using bilinear interpolation and the latents were the size of the output image. Is this not happening here?

Ok, I was wrong. In Katherine's upscaler the latents were upscaled and provided as conditioning. Now we create latents the same size as the low-res image and the vae decodes the final result to upscale it.

patrickvonplaten · 2022-11-25T12:55:55Z

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py

+        unet: UNet2DConditionModel,
+        low_res_scheduler: DDPMScheduler,
+        scheduler: Union[DDIMScheduler, PNDMScheduler, LMSDiscreteScheduler],
+        max_noise_level: int,


Note that this has to have a default with the new "optional" pipeline config arguments - otherwise this breaks

diffusers/src/diffusers/pipeline_utils.py

Line 684 in 35099b2

optional_parameters = set({k for k, v in parameters.items() if v.default is True})

Suggested change

max_noise_level: int,

max_noise_level: int = 9

Not sure what a good default is here.

Overall I agree with @pcuenca feedback that the code is from_pretrained has become a bit too much of a black box / magic - but I don't really see a way around it. Overall, I'd like to strongly advertise against using optional arguments to the pipeline inits, but if it makes sense here ok for me!

We should/could maybe jump on a call in a bit to discuss this

Aah, thanks!

Here we could default to the value for SD2 which is 350

Wouldn't 350 be too high for the upscaling pipeline?

Yes, but that high value is only used during training, here we set as indicator value which shouldn't be crossed.

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py

patrickvonplaten · 2022-11-25T13:00:19Z

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py

+    def __call__(
+        self,
+        prompt: Union[str, List[str]],
+        image: Union[torch.FloatTensor, PIL.Image.Image, List[PIL.Image.Image]],


Can the image be a latent?

no, the unet is conditioned on the low image not latents.

pcuenca · 2022-11-25T14:53:06Z

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py

+                slice_size = self.unet.config.attention_head_dim // 2
+            else:
+                # if `attention_head_dim` is a list, take the smallest head size
+                slice_size = min(self.unet.config.attention_head_dim)


Should we divide by two here as well?

pcuenca · 2022-11-25T14:55:56Z

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py

+
+    # Copied from diffusers.pipelines.stable_diffusion.pipeline_stable_diffusion.StableDiffusionPipeline.decode_latents with 0.18215->0.08333
+    def decode_latents(self, latents):
+        latents = 1 / 0.08333 * latents


patrickvonplaten

Cool good to merge for me!

* StableDiffusionUpscalePipeline * fix a few things * make it better * fix image batching * run vae in fp32 * fix docstr * resize to mul of 64 * doc * remove safety_checker * add max_noise_level * fix Copied * begin tests * slow tests * default max_noise_level * remove kwargs * doc * fix * fix fast tests * fix fast tests * no sf * don't offload vae Co-authored-by: Patrick von Platen <[email protected]>

StableDiffusionUpscalePipeline

a9631b2

fix a few things

fa08899

averad mentioned this pull request Nov 24, 2022

Come on, come on, let's adapt the conversion script to SD 2.0 #1388

Closed

make it better

5e22030

patil-suraj marked this pull request as ready for review November 25, 2022 11:19

patil-suraj added 10 commits November 25, 2022 12:25

fix image batching

883f2f7

run vae in fp32

56e6c34

fix docstr

c7cd6d3

resize to mul of 64

3eae7e5

doc

c856d70

Merge branch 'main' into upscale-pipeline

2a135cc

remove safety_checker

b8e13f5

add max_noise_level

ec1c875

fix Copied

b57ffc9

begin tests

8ce48bd

pcuenca approved these changes Nov 25, 2022

View reviewed changes

patrickvonplaten reviewed Nov 25, 2022

View reviewed changes

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py Show resolved Hide resolved

patrickvonplaten reviewed Nov 25, 2022

View reviewed changes

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Nov 25, 2022

View reviewed changes

patil-suraj and others added 9 commits November 25, 2022 14:08

slow tests

f691200

default max_noise_level

ba8485f

remove kwargs

c611911

doc

dd520fd

Merge branch 'main' into upscale-pipeline

a88a961

fix

d64b624

fix fast tests

a822f30

fix fast tests

38b49ca

no sf

80b55be

don't offload vae

306628c

patil-suraj changed the title ~~[wip] StableDiffusionUpscalePipeline~~ StableDiffusionUpscalePipeline Nov 25, 2022

patil-suraj requested review from patrickvonplaten and pcuenca November 25, 2022 14:47

pcuenca approved these changes Nov 25, 2022

View reviewed changes

patrickvonplaten approved these changes Nov 25, 2022

View reviewed changes

patil-suraj merged commit 9ec5084 into main Nov 25, 2022

patil-suraj deleted the upscale-pipeline branch November 25, 2022 15:13

averad mentioned this pull request Nov 25, 2022

Stable Diffusion 2 #1392

Closed

2 tasks

StableDiffusionUpscalePipeline #1396

StableDiffusionUpscalePipeline #1396

Uh oh!

Conversation

patil-suraj commented Nov 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Nov 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pcuenca Nov 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

patil-suraj commented Nov 24, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 24, 2022 •

edited

Loading

pcuenca Nov 25, 2022 •

edited

Loading