Improve SD XL by patrickvonplaten · Pull Request #3968 · huggingface/diffusers

patrickvonplaten · 2023-07-06T14:58:44Z

What does this PR do?

This PR makes sure diffusers Stable Diffusion XL can generate images of any size. We also make sure that both single file format and diffusers format can be loaded.

Diffusers format:

from diffusers import StableDiffusionXLPipeline, StableDiffusionXLImg2ImgPipeline
import torch

use_refiner = True

pipe = StableDiffusionXLPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-0.9", torch_dtype=torch.float16, variant="fp16", use_safetensors=True)
pipe.to("cuda")

if use_refiner:
    refiner = StableDiffusionXLImg2ImgPipeline.from_pretrained("stabilityai/stable-diffusion-xl-refiner-0.9", torch_dtype=torch.float16, use_safetensors=True, variant="fp16")
    refiner.to("cuda")

prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt=prompt, output_type="latent" if use_refiner else "pil").images[0]

if use_refiner:
    image = refiner(prompt=prompt, image=image[None, :]).images[0]

Single File Format:

from diffusers import StableDiffusionXLPipeline, StableDiffusionXLImg2ImgPipeline
import torch

use_refiner = True

pipe = StableDiffusionXLPipeline.from_single_file("https://huggingface.co/stabilityai/stable-diffusion-xl-base-0.9/blob/main/sd_xl_base_0.9.safetensors", torch_dtype=torch.float16, use_safetensors=True)
pipe.to("cuda")

if use_refiner:
    refiner = StableDiffusionXLImg2ImgPipeline.from_single_file("https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-0.9/blob/main/sd_xl_refiner_0.9.safetensors", torch_dtype=torch.float16, use_safetensors=True)
    refiner.to("cuda")

prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt=prompt, output_type="latent" if use_refiner else "pil").images[0]

if use_refiner:
    image = refiner(prompt=prompt, image=image[None, :]).images[0]

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

bghira · 2023-07-06T15:18:55Z

src/diffusers/pipelines/stable_diffusion/convert_from_ckpt.py

+            steps_offset=1,
+            timestep_spacing="leading",
+        )
+        scheduler = EulerDiscreteScheduler.from_config(scheduler_dict)


can we pass a scheduler in? i would like to be able to convert it and attach DDIM, as that's a better sampler for SDXL even though it's divergent from their upstream impl.

I like the idea of being able to customize the scheduler. I like Euler, but I usually customize the settings of the scheduler.

bghira · 2023-07-06T15:19:33Z

src/diffusers/pipelines/stable_diffusion_xl/pipeline_stable_diffusion_xl_img2img.py

-        original_size: Tuple[int, int] = (1024, 1024),
+        original_size: Tuple[int, int] = None,
        crops_coords_top_left: Tuple[int, int] = (0, 0),
-        target_size: Tuple[int, int] = (1024, 1024),
+        target_size: Tuple[int, int] = None,


why is this now defaulting to None? can you explain a bit?

It will then later be set to the passed height and width this makes sure we generate correct outputs for different input sizes than 1024

bghira

thank you for fixing the CKPT converter.

into improve_sd_xl

Mark-divinci · 2023-07-26T03:38:20Z

i want to increase the batch_size to get larger throughput,but i got linear growth of latency.if i run some incorrect settings?

* improve sd xl * correct more * finish * make style * fix more

improve sd xl

3112b44

bghira reviewed Jul 6, 2023

View reviewed changes

patrickvonplaten added 2 commits July 6, 2023 15:28

correct more

b17e339

finish

77ae492

bghira approved these changes Jul 6, 2023

View reviewed changes

patrickvonplaten added 3 commits July 6, 2023 17:51

Merge branch 'main' into improve_sd_xl

bc6d4a4

make style

49d2bf1

Merge branch 'improve_sd_xl' of https://github.com/huggingface/diffusers

de13bda

into improve_sd_xl

patrickvonplaten changed the title ~~improve sd xl~~ Improve SD XL Jul 6, 2023

fix more

ba78204

patrickvonplaten merged commit 187ea53 into main Jul 6, 2023

patrickvonplaten deleted the improve_sd_xl branch July 6, 2023 16:11

patrickvonplaten mentioned this pull request Jul 6, 2023

from_ckpt is a bad name #3946

Closed

sanbuphy mentioned this pull request Jul 7, 2023

Can diffuser load local safetensors files? #3390

Closed

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

Improve SD XL (huggingface#3968)

0045640

* improve sd xl * correct more * finish * make style * fix more

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

Improve SD XL (huggingface#3968)

e3bcdf1

* improve sd xl * correct more * finish * make style * fix more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve SD XL#3968

Improve SD XL#3968
patrickvonplaten merged 7 commits intomainfrom
improve_sd_xl

patrickvonplaten commented Jul 6, 2023 •

edited

Loading

Uh oh!

bghira Jul 6, 2023

Uh oh!

JemiloII Jul 8, 2023

Uh oh!

bghira Jul 6, 2023

Uh oh!

patrickvonplaten Jul 6, 2023

Uh oh!

bghira left a comment

Uh oh!

Mark-divinci commented Jul 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

patrickvonplaten commented Jul 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

bghira Jul 6, 2023

Choose a reason for hiding this comment

Uh oh!

JemiloII Jul 8, 2023

Choose a reason for hiding this comment

Uh oh!

bghira Jul 6, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Jul 6, 2023

Choose a reason for hiding this comment

Uh oh!

bghira left a comment

Choose a reason for hiding this comment

Uh oh!

Mark-divinci commented Jul 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

patrickvonplaten commented Jul 6, 2023 •

edited

Loading