Add StableDiffusionXLControlNetPAGImg2ImgPipeline by satani99 · Pull Request #8990 · huggingface/diffusers

satani99 · 2024-07-26T13:25:00Z

What does this PR do?

fix #8700

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@yiyixuxu

satani99 · 2024-07-26T13:30:38Z

Generation code

import torch
import numpy as np
from PIL import Image 

from transformers import DPTFeatureExtractor, DPTForDepthEstimation
from diffusers import ControlNetModel, AutoencoderKL, AutoPipelineForImage2Image
from diffusers.utils import load_image

depth_estimator = DPTForDepthEstimation.from_pretrained("Intel/dpt-hybrid-midas").to("cuda")
feature_extractor = DPTFeatureExtractor.from_pretrained("Intel/dpt-hybrid-midas")
controlnet = ControlNetModel.from_pretrained(
        "diffusers/controlnet-depth-sdxl-1.0-small",
        variant="fp16",
        use_safetensors="True",
        torch_dtype=torch.float16,
        )
vae = AutoencoderKL.from_pretrained("madebyollin/sdxl-vae-fp16-fix", torch_dtype=torch.float16)
pipe = AutoPipelineForImage2Image.from_pretrained(
        "stabilityai/stable-diffusion-xl-base-1.0",
        controlnet=controlnet,
        vae=vae,
        variant="fp16",
        use_safetensors=True,
        torch_dtype=torch.float16,
        enable_pag=True,
        )
pipe.enable_model_cpu_offload()

def get_depth_map(image):
   image = feature_extractor(images=image, return_tensors="pt").pixel_values.to("cuda")
   with torch.no_grad(), torch.autocast("cuda"):
       depth_map = depth_estimator(image).predicted_depth

   depth_map = torch.nn.functional.interpolate(
        depth_map.unsqueeze(1),
        size=(1024, 1024),
        mode="bicubic",
        align_corners=False,
    )
   depth_min = torch.amin(depth_map, dim=[1, 2, 3], keepdim=True)
   depth_max = torch.amax(depth_map, dim=[1, 2, 3], keepdim=True)
   depth_map = (depth_map - depth_min) / (depth_max - depth_min)
   image = torch.cat([depth_map] * 3, dim=1)
   image = image.permute(0, 2, 3, 1).cpu().numpy()[0]
   image = Image.fromarray((image * 255.0).clip(0, 255).astype(np.uint8))
   return image



prompt = "A robot, 4k photo"
image = load_image(
        "https://huggingface.co/datasets/hf-internal-testing/diffusers-images/resolve/main"
        "/kandinsky/cat.png"
        ).resize((1024, 1024))

controlnet_conditioning_scale = 0.5 
depth_image = get_depth_map(image)

images = pipe(
        prompt,
        image=image,
        control_image=depth_image,
        strength=0.99,
        num_inference_steps=50,
        controlnet_conditioning_scale=controlnet_conditioning_scale,
        ).images
images[0].save(f"robot_cat.png")

It works with enable_pag=False but gives error when enable_pag=True.

Error: AttributeError: 'Image' object has no attribute 'shape'. Did you mean: 'save'?

satani99 · 2024-07-26T13:33:04Z

Any help would be nice. Thanks

yiyixuxu · 2024-07-27T01:36:56Z

can you share the full stack trace?

satani99 · 2024-07-27T05:24:52Z

File "/home/nikhil/Desktop/pag.py", line 72, in <module> images = pipe( File "/home/nikhil/miniconda3/envs/pag/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context return func(*args, **kwargs) File "/home/nikhil/Desktop/diffusers/src/diffusers/pipelines/pag/pipeline_pag_controlnet_sd_xl_img2img.py", line 1422, in __call__ height, width = control_image.shape[-2:] AttributeError: 'Image' object has no attribute 'shape'. Did you mean: 'save'?
pag.py is the above script.

src/diffusers/pipelines/pag/pipeline_pag_controlnet_sd_xl_img2img.py

…img.py Co-authored-by: YiYi Xu <yixu310@gmail.com>

satani99 · 2024-07-29T09:37:07Z

hi @yiyixuxu can you review this?

yiyixuxu · 2024-08-19T00:25:47Z

sorry this PR got lost too
could you resolve the conflicts?

src/diffusers/pipelines/controlnet/pipeline_controlnet_sd_xl_img2img.py

…mg2img.py

HuggingFaceDocBuilderDev · 2024-08-21T00:04:31Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu · 2024-08-21T07:10:27Z

@asomoza
I tested it, and it works now
do you want to check if it works as expected for you? (no worries if you don't have time)

asomoza · 2024-08-21T08:03:16Z

@yiyixuxu Tested it and seems ok, it's harder to see the difference here because the base image helps a lot even without PAG, but it still works similar to the other ones.

w/o pag	with pag

yiyixuxu · 2024-08-21T17:24:44Z

@satani99 thank you!

* Added pad controlnet sdxl img2img pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>

satani99 added 9 commits July 8, 2024 19:56

Added pad controlnet sdxl img2img pipeline

baf06ba

Added pag controlnet sdxl img2img pipeline

4ab58b1

Added pag controlnet sdxl img2img pipeline

b5144af

Added pag controlnet sdxl img2img pipeline

06b1af0

Added pag controlnet sdxl img2img pipeline

9c12878

Added test for controlnet pag sdxl img2img pipeline

60ab2b5

Added test pag controlnet sdxl img2img pipeline

6f21c3e

Added test pag controlnet sdxl img2img pipeline

f7a6ee2

Added test pag controlnet sdxl img2img pipeline

43d0719

Update __init__.py

73fab4c

yiyixuxu reviewed Jul 27, 2024

View reviewed changes

src/diffusers/pipelines/pag/pipeline_pag_controlnet_sd_xl_img2img.py Outdated Show resolved Hide resolved

Update src/diffusers/pipelines/pag/pipeline_pag_controlnet_sd_xl_img2…

dcd19f4

…img.py Co-authored-by: YiYi Xu <yixu310@gmail.com>

satani99 added 2 commits August 8, 2024 14:06

Updated

7698f0d

Updated

2ef0cde

Merge branch 'main' into sdxl_pag

8062870

yiyixuxu reviewed Aug 20, 2024

View reviewed changes

src/diffusers/pipelines/controlnet/pipeline_controlnet_sd_xl_img2img.py Outdated Show resolved Hide resolved

Update src/diffusers/pipelines/controlnet/pipeline_controlnet_sd_xl_i…

111090a

…mg2img.py

yiyixuxu added 2 commits August 21, 2024 03:58

style

574f1bb

copies

025c4e6

yiyixuxu mentioned this pull request Aug 21, 2024

fix a regression in is_safetensors_compatible #9234

Merged

fix

813fbd6

fix tests

a62e72e

yiyixuxu merged commit 9003d75 into huggingface:main Aug 21, 2024

satani99 deleted the sdxl_pag branch August 21, 2024 17:37

yiyixuxu added the PAG label Sep 4, 2024

sayakpaul pushed a commit that referenced this pull request Dec 23, 2024

Add StableDiffusionXLControlNetPAGImg2ImgPipeline (#8990)

8a17331

* Added pad controlnet sdxl img2img pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add StableDiffusionXLControlNetPAGImg2ImgPipeline#8990

Add StableDiffusionXLControlNetPAGImg2ImgPipeline#8990
yiyixuxu merged 19 commits intohuggingface:mainfrom
satani99:sdxl_pag

satani99 commented Jul 26, 2024 •

edited by yiyixuxu

Loading

Uh oh!

satani99 commented Jul 26, 2024

Uh oh!

satani99 commented Jul 26, 2024

Uh oh!

yiyixuxu commented Jul 27, 2024

Uh oh!

satani99 commented Jul 27, 2024

Uh oh!

Uh oh!

satani99 commented Jul 29, 2024

Uh oh!

yiyixuxu commented Aug 19, 2024

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 21, 2024

Uh oh!

yiyixuxu commented Aug 21, 2024

Uh oh!

asomoza commented Aug 21, 2024

Uh oh!

yiyixuxu commented Aug 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

satani99 commented Jul 26, 2024 • edited by yiyixuxu Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

satani99 commented Jul 26, 2024

Uh oh!

satani99 commented Jul 26, 2024

Uh oh!

yiyixuxu commented Jul 27, 2024

Uh oh!

satani99 commented Jul 27, 2024

Uh oh!

Uh oh!

satani99 commented Jul 29, 2024

Uh oh!

yiyixuxu commented Aug 19, 2024

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 21, 2024

Uh oh!

yiyixuxu commented Aug 21, 2024

Uh oh!

asomoza commented Aug 21, 2024

Uh oh!

yiyixuxu commented Aug 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

satani99 commented Jul 26, 2024 •

edited by yiyixuxu

Loading