facebookresearch / invariantriskminimization Goto Github PK

View Code? Open in Web Editor NEW

396.0 15.0 59.0 242 KB

PyTorch code to run synthetic experiments.

License: Other

Python 91.80% Shell 8.20%

invariantriskminimization's People

Contributors

Stargazers

Watchers

invariantriskminimization's Issues

Non-causal errors for scrambled samples

Hi,

I tried to reproduce Fig. 4 from the paper, the noncausal panels for the cases with scrambling (FOS, FES, POS, PES) are empty. The reason is that the noncausal variables are identified by finding zero weights
here, but after scrambling these are removed. I think the errors should be determined by applying the scrambling on the model weights first. Obviously the figure in the paper does contain graphs, could you be so kind to update the code you used? Thank you!

Different implementation in Colored MNIST

Hi author,

Thanks for your wonderful work. I am confused about the implementation of the squared gradient norm. In the paper, you use X^{e,i} and X^{e,j} to compute the squared gradient norm, where X^{e,i} and X^{e,j} are two random minibatches. However, it seems that the squared gradient norm is computed with the use of X^{e,i}, and X^{e,i} in colored MNIST. Do these two kinds of implementation work in the same way?

Best regards,
Qing LIAN

Some question about IRM.

Hi lopezpaz, IRM is a fantastic work and I am reading and trying to reproduce it. But I meet some questions.

When it comes to the colored MINIST experiment. The paper says it divides the MINIST into 3 environments and colors the images to red and green. So Does it mean the test environment also contains digits of the same color in the training set? I mean, suppose that there are digits 9 with green color, and the test set also contains this kind of images?
Can IRM solve multiple classification problems? If yes, should I replace binary loss with cross-entropy loss?

I am looking forward to you reply! Thanks.

question about batch implementation of IRM loss

Hi,

Thanks for the great work! I am trying to reproduce some results and have a question regarding batch implementation of IRM loss. In Section 3.2 and Appendix D, you suggest to use following to do batch implementation:

def compute_penalty(losses, dummy_w):
    g1 = grad(losses[0::2].mean(), dummy_w, create_graph=True)[0] 
    g2 = grad(losses[1::2].mean(), dummy_w, create_graph=True)[0]
    return (g1 * g2).sum()

I am wondering whether we can do following:

def compute_penalty(losses, dummy_w):
    g = grad(losses.mean(), dummy_w, create_graph=True)[0] 
    return (g ** 2).sum()

You mentioned that the former one is "unbiased estimate of the squared gradient norm", but I am not sure why it is the case. If you can provide some explanation, that would be great.

Thank you!

ImportError: cannot import name 'SEMICP'

It's a simple fix so I can propose a PR separately

facebookresearch / invariantriskminimization Goto Github PK

invariantriskminimization's People

Contributors

Stargazers

Watchers

Forkers

invariantriskminimization's Issues

Non-causal errors for scrambled samples

Different implementation in Colored MNIST

Some question about IRM.

question about batch implementation of IRM loss

grayscale_model=True?

Question about the Colored MNIST

Multi-class question

Question about the paper

import error in code/experiment_synthetic/main.py

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs