hila-chefer / targetclip Goto Github PK

View Code? Open in Web Editor NEW

233.0 233.0 27.0 44.57 MB

[ECCV 2022] Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

Python 7.33% Jupyter Notebook 92.43% C++ 0.03% Cuda 0.21%

clip computer-graphics eccv2022 image-editing image-generation image-manipulation stylegan2

targetclip's People

Contributors

Stargazers

Watchers

targetclip's Issues

let me congratulate you on this incredible job, unfortunately I don't have the computing power to train targets, I'd like you to add some celebrity targets like megan fox and emilia clacke, and other celebrities.
and add new ones, for people who don't have a gpu, every week if they have free time and no work is busy.

Problem about finding direction

I ran this command to reproduce your results:
python3 optimization/find_dirs.py --target_path dirs/targets/avatar.jpg --dir_name results_folder_avatar --weight_decay 3e-3 --lambda_consistency 0.6 --step 1000 --lr 0.2 --num_directions 8 --num_images 8
But I got strange result as bellow: here is some results. Anything wrong with me?

encoding method

I wonder how to encode target or source image in latent space. I usually use e4e, image2styleGAN, which one did you use?

Role of coefficients

Hello again,

in these marked lines you initialize a set of coefficients to optimize over. As far as I can see, these are not mentioned in the paper. The coefficients are multiplied by the direction per source image, so I get that you want to optimize for a different scale of the direction vector per source vector. I have some questions on this:

Did you try it without these coefficients?
To what values do the coefficients converge to? Do they stay close to 1?
You re-initialize the Adam optimizer for the coefficients for every step within the optimization, hence drastically changing the behavior of the optimizer. Is this intended or a misplacement? If it is intended, what is it used for?

Thanks again for your work! I hope I am not too picky on this - I'm just curious about the topic of semantics in these latent spaces :-)

TargetCLIP/optimization/find_dirs.py

Lines 123 to 140 in b5dd2a4

 coefficients = [None] * NUM_IMAGES 

 for n in range(NUM_IMAGES): 

 coefficient = torch.ones(1).to("cuda") 

 coefficient.requires_grad = True 

 coefficients[n] = coefficient 

 opt_loss = torch.Tensor([float("Inf")]).to("cuda") 

 pbar = tqdm(range(args.step)) 

 for i in pbar: 

 # calculate learning rate 

 t = i / args.step 

 lr = get_lr(t, args.lr) 

 optimizer.param_groups[0]["lr"] = lr 

 optimizer_coeffs = optim.Adam(coefficients, lr=args.lr, weight_decay=0.01) 

 loss = torch.zeros(1).cuda() 

 target_semantic = torch.zeros(1).cuda()

What is the parameters of training

Great work! I tried to train my own dirs, but failed in some case. Are you using the default parameters to get them?

ModuleNotFoundError bug colab

steps to reproduce

!git clone  https://github.com/hila-chefer/TargetCLIP.git

%cd /content/TargetCLIP

!python /content/TargetCLIP/optimization/find_dirs.py

Traceback (most recent call last):
  File "/content/TargetCLIP/optimization/find_dirs.py", line 9, in <module>
    from criteria.clip_loss import CLIPLoss
ModuleNotFoundError: No module named 'criteria'

but criteria is in the root directory

Efficiency: no recalculation of original latents needed

TargetCLIP/optimization/find_dirs.py

Lines 142 to 144 in b5dd2a4

 with torch.no_grad(): 

 img_gen, _ = g_ema([latents], input_is_latent=True, randomize_noise=False) 

 image_gen_clip = clip_loss.module.encode(img_gen)

In these lines the latents are recalculated in the nested loop in every inner loop. The latents themselves stay constant though, so it can be done once outside of these loops. Only the augmented latents (latents + direction) need to be recalculated here.

hila-chefer / targetclip Goto Github PK

targetclip's People

Contributors

Stargazers

Watchers

Forkers

targetclip's Issues

how to create my own dir?

amazing work

Problem about finding direction

encoding method

Role of coefficients

What is the parameters of training

ModuleNotFoundError bug colab

Efficiency: no recalculation of original latents needed

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs

	coefficients = [None] * NUM_IMAGES
	for n in range(NUM_IMAGES):
	coefficient = torch.ones(1).to("cuda")
	coefficient.requires_grad = True
	coefficients[n] = coefficient

	opt_loss = torch.Tensor([float("Inf")]).to("cuda")
	pbar = tqdm(range(args.step))

	for i in pbar:
	# calculate learning rate
	t = i / args.step
	lr = get_lr(t, args.lr)
	optimizer.param_groups[0]["lr"] = lr

	optimizer_coeffs = optim.Adam(coefficients, lr=args.lr, weight_decay=0.01)
	loss = torch.zeros(1).cuda()
	target_semantic = torch.zeros(1).cuda()

	with torch.no_grad():
	img_gen, _ = g_ema([latents], input_is_latent=True, randomize_noise=False)
	image_gen_clip = clip_loss.module.encode(img_gen)