[SDXL ControlNet Training] Follow-up fixes by sayakpaul · Pull Request #4188 · huggingface/diffusers

sayakpaul · 2023-07-21T04:00:30Z

Fixes the issues from #4038:

Related to: #4089

HuggingFaceDocBuilderDev · 2023-07-21T04:07:26Z

The documentation is not available anymore as the PR was closed or merged.

williamberman · 2023-07-21T04:20:58Z

examples/controlnet/train_controlnet_sdxl.py

+
+        # fingerprint used by the cache for the other processes to load the result
+        # details: https://github.com/huggingface/diffusers/pull/4038#discussion_r1266078401
+        new_fingerprint = Hasher.hash(args)


Are the args going to actually be good enough to create a hash? Multiple runs of the script might have the same args. Is that ok? I'm not sure I haven't thought through well enough.

Ideally we can get the PID of the parent process if the parent process is accelerate and hash that. If the parent process is not accelerate, we don't have to pass any additional fingerprint

Resorting to @lhoestq again.

Multiple runs of the script might have the same args. Is that ok? I'm not sure I haven't thought through well enough.

If that is the case, we would want to avoid the execution of the map fn and instead load from the cache no? Or will there be undesired consequences of that?

In any case, coming to your suggestion on

Ideally we can get the PID of the parent process if the parent process is accelerate and hash that. If the parent process is not accelerate, we don't have to pass any additional fingerprint

Are you thinking of something like:

with accelerator.main_process_first(): from datasets.fingerprint import Hasher # fingerprint used by the cache for the other processes to load the result # details: https://github.com/huggingface/diffusers/pull/4038#discussion_r1266078401 if accelerator.is_main_process: pid = os.getpid() new_fingerprint = Hasher.hash(pid) train_dataset = train_dataset.map(compute_embeddings_fn, batched=True, new_fingerprint=new_fingerprint)

up to you if it's ok to reload from the cache when calling map, I'm not as familiar with the script :) if it is ok, a comment in the code would be nice on under what circumstances and why

I'm not familiar on the accelerate forking model and if one of the scripts themselves ends up being the parent process or if there's a separate accelerate script that forks into the children. If you need the parent pid (again depending on who actually gets forked), you would call os.getppid

@lhoestq WDYT? IMO, it should be okay to

If that is the case, we would want to avoid the execution of the map fn and instead load from the cache no? Or will there be undesired consequences of that?

If that is the case, we would want to avoid the execution of the map fn and instead load from the cache no? Or will there be undesired consequences of that?

Since the args are the same it will reload from cache in subsequent run instead of reprocessing the data :)
The resulting dataset doesn't depend on whether accelerate is used no ?

examples/controlnet/train_controlnet_sdxl.py

patrickvonplaten

Ok for me!

@lhoestq

* hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.

@lhoestq

* hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.

@lhoestq

* hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.

@lhoestq

* hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.

sayakpaul added 2 commits July 21, 2023 08:58

hash computation. thanks to @lhoestq

3d6b4dd

disable dtype casting.

0d39073

sayakpaul requested a review from williamberman July 21, 2023 04:00

sayakpaul mentioned this pull request Jul 21, 2023

[Core] add: controlnet support for SDXL #4038

Merged

3 tasks

williamberman reviewed Jul 21, 2023

View reviewed changes

examples/controlnet/train_controlnet_sdxl.py Outdated Show resolved Hide resolved

remove comments.

29c2133

patrickvonplaten approved these changes Jul 21, 2023

View reviewed changes

sayakpaul merged commit 4dcab92 into main Jul 21, 2023

sayakpaul deleted the fixes/controlnet-sdxl-training branch July 21, 2023 15:25

orpatashnik pushed a commit to orpatashnik/diffusers that referenced this pull request Aug 1, 2023

[SDXL ControlNet Training] Follow-up fixes (huggingface#4188)

1430dab

* hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.

orpatashnik pushed a commit to orpatashnik/diffusers that referenced this pull request Aug 1, 2023

[SDXL ControlNet Training] Follow-up fixes (huggingface#4188)

0b87d50

* hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.

orpatashnik pushed a commit to orpatashnik/diffusers that referenced this pull request Aug 1, 2023

[SDXL ControlNet Training] Follow-up fixes (huggingface#4188)

888e7f1

* hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

[SDXL ControlNet Training] Follow-up fixes (huggingface#4188)

88d29cb

* hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SDXL ControlNet Training] Follow-up fixes#4188

[SDXL ControlNet Training] Follow-up fixes#4188
sayakpaul merged 3 commits intomainfrom
fixes/controlnet-sdxl-training

sayakpaul commented Jul 21, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jul 21, 2023 •

edited

Loading

Uh oh!

williamberman Jul 21, 2023 •

edited

Loading

Uh oh!

sayakpaul Jul 21, 2023

Uh oh!

sayakpaul Jul 21, 2023

Uh oh!

williamberman Jul 21, 2023

Uh oh!

sayakpaul Jul 21, 2023

Uh oh!

lhoestq Jul 21, 2023 •

edited

Loading

Uh oh!

Uh oh!

patrickvonplaten left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

sayakpaul commented Jul 21, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

williamberman Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

williamberman Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

lhoestq Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

HuggingFaceDocBuilderDev commented Jul 21, 2023 •

edited

Loading

williamberman Jul 21, 2023 •

edited

Loading

lhoestq Jul 21, 2023 •

edited

Loading