Light

duvenaud / deep-limits Goto Github PK

View Code? Open in Web Editor NEW

69.0 5.0 37.0 535.99 MB

Repo for a paper about constructing priors on very deep models.

License: MIT License

TeX 77.30% MATLAB 9.92% HTML 10.54% PostScript 2.24%

deep-limits's Introduction

Avoiding Pathologies in Very Deep Networks

Experiment source code and latex source for http://arxiv.org/pdf/1402.5836.pdf

Abstract:

Choosing appropriate architectures and regularization strategies for deep networks is crucial to good predictive performance. To shed light on this problem, we analyze the analogous problem of constructing useful priors on compositions of functions. Specifically, we study the deep Gaussian process, a type of infinitely-wide, deep neural network. We show that in standard architectures, the representational capacity of the network tends to capture fewer degrees of freedom as the number of layers increases, retaining only a single degree of freedom in the limit. We propose an alternate network architecture which does not suffer from this pathology. We also examine deep covariance functions, obtained by composing infinitely many feature transforms. Lastly, we characterize the class of models obtained by performing dropout on Gaussian processes.

This paper appeared in the 2014 Artificial Intelligence and Statistics conference, held in Reykjavik, Iceland.

Authors: David Duvenaud, Oren Rippel, Ryan P. Adams, and Zoubin Ghahramani

Feel free to email me with any questions at ([email protected]).

deep-limits's People

Contributors

Stargazers

Watchers

deep-limits's Issues

A mistake in your paper

A little mistake in your thesis. In Equation(5.25), the sign before (x-x') term should be plus. Also the Equation(5.11), I get the mean value 1/2*(2*log(sigma/omega)-log(2)-gamma), It is the same as yours except for the 1/2 term in the front. If it is true, the variance should be pi^2/8. Would you like to check your answer? I can send my code to you and see if it goes wrong somewhere. I would be happy to discuss with you in any details.

What's the relationship between your definition of deep gaussian process and original one?

Hi, I read the paper <> written by Andreas C. Damianou and Neil D. Lawrence. And I am a little confused about the relationship and difference between you two. Personally, I think your definition is more apt to understand. Can you say something about your understanding ? Does Damianou's deep GP also has pathology you mentioned in your paper?

Question for your paper

Hi, I am reading your Phd thesis recently. I am a little confused about the Equation 5.11 in the chapter 5. Can you give me any clue or references to deduce the equation ? Thank you very much.

Question for the distribution of singular values

Hi, I run your code in the file deep_singular_value_dist.m. I have a question about the variable scale in the file. It turn out this variable is so important here. If it is set to be 1, the result of the figure 1 and figure 2 do not show any obvious difference. Why does it happen? What's the function of variable scale ? Can you clarify it to me? I do not find a clue in your PhD thesis. Thank you very much!

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs