taited / sgdiff Goto Github PK

View Code? Open in Web Editor NEW

30.0 5.0 3.0 32.13 MB

Official implementation of SGDiff (ACM MM '23)

Home Page: https://taited.github.io/sgdiff-project

License: Apache License 2.0

Python 41.18% Jupyter Notebook 58.80% Dockerfile 0.01% Shell 0.02%

diffusion fashion glide sgdiff multimedia style style-transfer

sgdiff's People

Contributors

Stargazers

Watchers

Forkers

ishow520 ming-zch forever2101

sgdiff's Issues

Looking forward to being able to release the training code soon,thanks

Dataset

Thanks for the great work! When will the dataset be available?

Implementation of perceptual loss

Thank you for an outstanding job！

When do you plan to release the training code? In particular, does the perceptual loss in the paper use StableDiffusionPipeline to obtain the generated image after each noise estimation?

Looking forward for your reply.

training code?

Hello, I am very interested in your work. When can you release training code?

Results are not good

i use this picture

and use Vincent van Gogh’s Starry Night as text prompt
the result is not as good as expected

How to do multi head attention, but the shape of q, k, and v is different?

The SCA moudle adopt semantic features form clip and text features form clip, the Q is only from text, K and V are added by text and image. And your picture in the paper shows that the Q, K, V have different shape, so how to do Q@K? Is this is your SCA code?

def forward(self, img, text_emb): if self.clip_norm: img = (img + 1) / 2 img = F.batch_norm(img, self.mean.to(self.device), self.std.to(self.device)) img = F.interpolate(img, (224, 224)) image_features = self.model(img) if self.last_layer_proj: image_features = torch.einsum('bld,ds->bls', image_features, self.model.proj) if self.cross_attn is not None: emb_features = self.cross_attn( text_emb.permute(0, 2, 1), image_features.permute(0, 2, 1)).permute(0, 2, 1) if self.skip_module is not None: if self.learned_length: residual = self.skip_module(emb_features.permute(0, 2, 1)) residual = residual.permute(0, 2, 1) else: residual = self.skip_module(emb_features) text_emb += residual return text_emb

taited / sgdiff Goto Github PK

sgdiff's People

Contributors

Stargazers

Watchers

Forkers

sgdiff's Issues

Looking forward to being able to release the training code soon,thanks

Dataset

Implementation of perceptual loss

training code?

Results are not good

How to do multi head attention, but the shape of q, k, and v is different?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs