One of the stated design decisions from the readme was to support arbitrary modalities

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Taking a look at the function as is: <div class="snippet-clipboard-content notrans

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

BTW, it's super nice to get all your feedback here <a class="user-mention notranslate"

Thanks for reporting <a class="user-mention notranslate" data-hovercard-type="user" da

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

New modalities about diffusers HOT 10 CLOSED

huggingface commented on May 21, 2024 1

New modalities

from diffusers.

Comments (10)

richardrl commented on May 21, 2024 3

@anton-l @patrickvonplaten Thanks for your input thus far.

I took the latest commit (as of this moment) and made a minimum reproduction of a 1D MLP model and training.

I had to make an additional modification to pipeline_ddpm.py to support noise samples of the right shape.

bimodal_testt.zip

python3 train_unconditional_gaussian_test.py This runs a test on a bimodal gaussian distribution centered at +33, -33 with low variance

It seems to not capture the -33 mode after an epoch or two. Am running the training overnight to see what happens.

Welcome you guys to try running this to see if there's anything I did wrong

from diffusers.

patrickvonplaten commented on May 21, 2024 2

Taking a look at the function as is:

    def training_step(self, original_samples: torch.Tensor, noise: torch.Tensor, timesteps: torch.Tensor):
        if timesteps.dim() != 1:
            raise ValueError("`timesteps` must be a 1D tensor")

        device = original_samples.device
        batch_size = original_samples.shape[0]
        timesteps = timesteps.reshape(batch_size, 1, 1, 1)

        sqrt_alpha_prod = self.alphas_cumprod[timesteps] ** 0.5
        sqrt_one_minus_alpha_prod = (1 - self.alphas_cumprod[timesteps]) ** 0.5
        noisy_samples = sqrt_alpha_prod.to(device) * original_samples + sqrt_one_minus_alpha_prod.to(device) * noise
        return noisy_samples

Note that the input can be both torch and numpy tensors -> this should be changed.

Also there shouldn't be any .to(device) statements, nor framework and modality spefific .reshape(...) operation.

I'd be in favor of implementing framework specific (one for PT one for TF) functions called

def extract(....) in SchedulerMixin that have if framework == "pt" statements in them. Also note that we shouldn't assume to know the dimension of the input original_samples

from diffusers.

patrickvonplaten commented on May 21, 2024 1

@anton-l - we need to make sure that training_step is both framework agnostic and shape agnostic

from diffusers.

patrickvonplaten commented on May 21, 2024 1

BTW, it's super nice to get all your feedback here @richardrl - thanks a lot!

from diffusers.

patil-suraj commented on May 21, 2024

Thanks for reporting @richardrl !
Indeed the plan is to support multiple modalities, but we haven't yet tested the schedulers with 1D data.

cc @patrickvonplaten @anton-l

from diffusers.

anton-l commented on May 21, 2024

Now we use match_shape(timestaps, original_samples) for everything, which is framework- and shape-agnostic: https://github.com/huggingface/diffusers/blob/main/src/diffusers/schedulers/scheduling_ddpm.py#L146

from diffusers.

github-actions commented on May 21, 2024

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

from diffusers.

patrickvonplaten commented on May 21, 2024

@anton-l as you've re-opened the issue -> are you planning on doing something with it?

from diffusers.

github-actions commented on May 21, 2024

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

from diffusers.

anton-l commented on May 21, 2024

I think we have support for all shapes now, agreed with the stalebot :)

from diffusers.

New modalities about diffusers HOT 10 CLOSED

Comments (10)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs