[Core] refactor `transformer_2d` forward logic into meaningful conditions. by sayakpaul · Pull Request #7489 · huggingface/diffusers

sayakpaul · 2024-03-27T09:50:59Z

What does this PR do?

Refactors the forward() of Transformers2DModel for easier readability.

More specifically, the PR refactors the forward() method of Transformers2DModel to have the following unified structure:

def forward(self):
    # handle attention masking. 
    ...

    # 1. input-level operations.
    if self.is_input_continuous:
        ... = self._operate_on_continuous_inputs(...)
    elif self.is_input_vectorized:
    	... = self.latent_image_embedding(...) # for vectorized inputs only this is needed.
    elif self.is_input_patches:
        ... = self._operate_on_patched_inputs(...)

    # 2. Blocks (common across all the variants)
    for block in self.transformer_blocks:
    	...

    # 3. Outputs.
    if self.is_input_continuous:
        ... = self._get_output_for_continuous_inputs(...)
    elif self.is_input_vectorized:
        output = self._get_output_for_vectorized_inputs(...)
    elif self.is_input_patches:
        output = self._get_output_for_patched_inputs(...)

    return output

About the possibility of spinning out separate transformer classes for patched and vectorized inputs, what's the consensus? Currently, we have DiT and PixArt-Alpha that use patched inputs. So, for those checkpoints (DiT and PixArt-Alpha), are we thinking of just updating the configs to use PatchedTransformer2DModel? (Same could be done for the VectorizedTransformer2DModel)

If that's the case I think it could be quite breaking of a change. Many folks use the DiT and PixArt models especially in the light of many Open SoRA initiative. Introducing a change like this could be problematic. Should we consider throwing a deprecation warning, instead?

@yiyixuxu @DN6 please let me know your thoughts.

HuggingFaceDocBuilderDev · 2024-03-27T09:59:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2024-04-05T04:25:38Z

@DN6 @yiyixuxu a gentle ping.

src/diffusers/models/transformers/transformer_2d.py

DN6

LGTM. Left a comment, but it's mostly a nit. Is the failing test related?

sayakpaul · 2024-04-08T06:33:40Z

@DN6 WDYT?

About the possibility of spinning out separate transformer classes for patched and vectorized inputs, what's the consensus? Currently, we have DiT and PixArt-Alpha that use patched inputs. So, for those checkpoints (DiT and PixArt-Alpha), are we thinking of just updating the configs to use PatchedTransformer2DModel? (Same could be done for the VectorizedTransformer2DModel)

(refer to the OP)

yiyixuxu · 2024-04-08T18:26:24Z

I'm cool with this PR once you and @DN6 are happy with this!

If that's the case I think it could be quite breaking of a change. Many folks use the DiT and PixArt models especially in the light of many Open SoRA initiative. Introducing a change like this could be problematic. Should we consider throwing a deprecation warning, instead?

for this, yes, I agree it is a highly used block, and I think we should give more thought to avoid breaking changes.
can potentially deprecate Tranformer2DModel and use 3 different Transformers2DModel instead: ContinousTransformers2DMode, PatchedTransformers2DModel, VectorizedTransformers2DModel;
just spin out the patched and vectorized, and sending a deprecation message for them also works I think

would love to hear your thoughts @DN6

sayakpaul · 2024-04-09T01:36:08Z

Cool. Will work on the deprecation in a future PR then as we discuss the best design.

Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

This reverts commit 12178b1.

sayakpaul · 2024-04-10T02:55:48Z

@DN6 I had to revert your suggestion because height and width are calculated differently when inputs are continuous and patched. Hope that's okay.

Gonna merge the PR after the CI is green.

…ions. (#7489) * refactor transformer_2d forward logic into meaningful conditions. * Empty-Commit * fix: _operate_on_patched_inputs * fix: _operate_on_patched_inputs * check * fix: patch output computation block. * fix: _operate_on_patched_inputs. * remove print. * move operations to blocks. * more readability neats. * empty commit * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Revert "Apply suggestions from code review" This reverts commit 12178b1. --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

refactor transformer_2d forward logic into meaningful conditions.

453b21a

sayakpaul requested review from DN6 and yiyixuxu March 27, 2024 09:50

Empty-Commit

d583d6e

sayakpaul mentioned this pull request Mar 27, 2024

[Core] refactor transformers 2d into multiple init variants. #7491

Merged

sayakpaul added 9 commits March 27, 2024 17:05

fix: _operate_on_patched_inputs

f10a9b3

fix: _operate_on_patched_inputs

b353f34

check

7cd74b1

fix: patch output computation block.

1f311dc

fix: _operate_on_patched_inputs.

30e6f02

Merge branch 'main' into transformers-2d-refactor

c38dd5f

remove print.

68f1fa9

move operations to blocks.

234265d

more readability neats.

9b74324

sayakpaul added the refactor label Apr 4, 2024

sayakpaul added 3 commits April 4, 2024 07:55

Merge branch 'main' into transformers-2d-refactor

1351d1b

empty commit

1b79450

Merge branch 'main' into transformers-2d-refactor

acf7ca1

Merge branch 'main' into transformers-2d-refactor

a23c241

DN6 reviewed Apr 8, 2024

View reviewed changes

src/diffusers/models/transformers/transformer_2d.py Show resolved Hide resolved

DN6 approved these changes Apr 8, 2024

View reviewed changes

Merge branch 'main' into transformers-2d-refactor

06c8baf

Merge branch 'main' into transformers-2d-refactor

1c723c3

sayakpaul and others added 2 commits April 10, 2024 08:18

Merge branch 'main' into transformers-2d-refactor

e45fcfa

Apply suggestions from code review

12178b1

Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

Revert "Apply suggestions from code review"

e3de508

This reverts commit 12178b1.

sayakpaul merged commit 44f6b85 into main Apr 10, 2024

sayakpaul deleted the transformers-2d-refactor branch April 10, 2024 03:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] refactor `transformer_2d` forward logic into meaningful conditions.#7489

[Core] refactor `transformer_2d` forward logic into meaningful conditions.#7489
sayakpaul merged 20 commits intomainfrom
transformers-2d-refactor

sayakpaul commented Mar 27, 2024 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 27, 2024

Uh oh!

sayakpaul commented Apr 5, 2024

Uh oh!

Uh oh!

DN6 left a comment •

edited

Loading

Uh oh!

sayakpaul commented Apr 8, 2024

Uh oh!

yiyixuxu commented Apr 8, 2024 •

edited

Loading

Uh oh!

sayakpaul commented Apr 9, 2024

Uh oh!

sayakpaul commented Apr 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

sayakpaul commented Mar 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Mar 27, 2024

Uh oh!

sayakpaul commented Apr 5, 2024

Uh oh!

Uh oh!

DN6 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Apr 8, 2024

Uh oh!

yiyixuxu commented Apr 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sayakpaul commented Apr 9, 2024

Uh oh!

sayakpaul commented Apr 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sayakpaul commented Mar 27, 2024 •

edited

Loading

DN6 left a comment •

edited

Loading

yiyixuxu commented Apr 8, 2024 •

edited

Loading