Hello, Thank you for the amazing and insightful work! I was trying t

Training checkpoints are provided <a href="https://drive.google.com/drive/folders/1Nne

Thank you <a class="user-mention notranslate" data-hovercard-type="user" data-hovercar

Thank you so much for your quick response <a class="user-mention notranslate" data-hov

Structure of the MI-GAN inference model different from the one used for training? about mi-gan HOT 7 CLOSED

anirao26 commented on June 7, 2024

Structure of the MI-GAN inference model different from the one used for training?

from mi-gan.

Comments (7)

AndranikSargsyan commented on June 7, 2024

Hello. During training the model size is bigger, because our method uses a model reparameterization trick as described in the section 3.3 of the paper. Once the model training is finished you need to fuse the weights in order to decrease the model size for inference. For fusion, you can use the scripts/export_inference_model.py script. You can find the details of how to use that script at the end of README's Training section.

from mi-gan.

anirao26 commented on June 7, 2024

Thank you so much for the clarification. I wanted to fine tune the model instead of training the whole model again. The layers don't match exactly so I was wondering the best way to do it?

from mi-gan.

AndranikSargsyan commented on June 7, 2024

Training checkpoints are provided in this Google Drive folder. You can download the corresponding (unfused) training checkpoint from there and use it for fine-tuning.

from mi-gan.

anirao26 commented on June 7, 2024

Thank you @AndranikSargsyan! One more question, in the code, can you tell me what the G_ema variable signifies? I thought the G variable was the MI-GAN generator model and D is the discriminator.

from mi-gan.

AndranikSargsyan commented on June 7, 2024

G_ema keeps track of exponential moving average (ema) of the generator weights during training. Exponential moving averaging of the generator weights is a popular strategy for improving generator performance.

from mi-gan.

anirao26 commented on June 7, 2024

Thank you so much for your quick response @AndranikSargsyan. One final thing I wanted to ask you was about the training phase. In the code, I see that there are two instances of G (Gmain and GReg) and D (Dmain and DReg) since in the default setting regularization interval is set to 4 and 16 respective;y. I am unable to understand why we have two instances of the generator and discriminator models in this case. Really appreciate your help!

from mi-gan.

AndranikSargsyan commented on June 7, 2024

There're actually no two instances of generator and discriminator (except the G_ema for generator). Gmain/Greg and Dmain/Dreg are just strings which signify the current phase of the training: i.e. Gmain/Dmain signify that there should be no regularization in the loss computation of current iteration, while Greg/Dreg signify that loss computation should include regularization component such as R1 used in the training of the discriminator of MI-GAN.

from mi-gan.

Structure of the MI-GAN inference model different from the one used for training? about mi-gan HOT 7 CLOSED

Comments (7)

Related Issues (11)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs