emtiyaz / vadam Goto Github PK

Code for ICML 2018 paper on "Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam" by Khan, Nielsen, Tangkaratt, Lin, Gal, and Srivastava

Python 20.52% MATLAB 50.59% HTML 4.18% CSS 0.01% Makefile 0.25% C++ 2.99% Fortran 19.90% C 0.96% M 0.01% Jupyter Notebook 0.59%

vadam's People

Contributors

Stargazers

Watchers

vadam's Issues

Bayesian Optimisation - Test Set Contamination

According to Yarin Gal's repo DropoutUncertaintyExps, there was test set contamination due to the bayesian optmisation of hypers:

Update (2018) We replaced the Bayesian optimisation implementation (which was used to find hypers) with a grid-search over the hypers. This is following feedback from @capybaralet who spotted test-set contamination (some train-set points, used to tune hypers which were shared across all splits, were used as test-set points in later splits). The new implementation iterates over the 20 splits, and for each train-test split it creates a new train-val split to tune hypers. These hypers are discarded between different train-test splits.

Yet, it seems the UCI regression experiments here use the same splits as in the DropoutUncertaintyExps repo. Was the bug fixed or is it still there and the reason for doing so here was the reviewers' request?

Is Vadam specific to Reinforcement-Learning?

This should work for any model, right? Not just for RL? I'm a bit confused for why the ActorNetworkEpsilon in the Tensorflow implementation contains code to optimise the policy network.

I'm trying to use it for StyleGAN-based image reconstruction, i.e. find a distribution over low-dimensional latents z such that G(z) match a given input image. The network is already trained and fixed, it's just the latent that I want to optimise.

Also, why is the design such that the critic network is also exposed to the outside DDPG.py? That is also something RL-specific, right?

Thanks,
Raz

for s in range(self.num_samples):

emtiyaz / vadam Goto Github PK

vadam's People

Contributors

Stargazers

Watchers

Forkers

vadam's Issues

Bayesian Optimisation - Test Set Contamination

Is Vadam specific to Reinforcement-Learning?

errors when running the matlab scripts

Log loss vs ELBO optimisation

please add some comments on these lines

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs