GithubHelp home page GithubHelp logo

fanfannothing / awesome-hyperparams Goto Github PK

View Code? Open in Web Editor NEW

This project forked from pemami4911/awesome-hyperparams

0.0 3.0 0.0 3 KB

A curated list of awesome hyperparameters for deep learning

awesome-hyperparams's Introduction

Awesome Hyperparams

Contributors

Please provide citations (e.g., arxiv link, blog post, github repo, etc). Any info on the hyperparameter search process taken in the original work (if available) is a bonus. Please use scientific "e" notation (10e5 instead of 1000000).

Example contribution:

Original DQN

hyperparam name default value
lr 25e-5
RMSprop momentum 0.95
RMSprop epsilon 0.01
discount factor 0.99
epsilon(-greedy) 1 annealed to 0.1 over 1 million frames
minibatch size 32
replay memory size 10e5
weight init Xavier (Torch default)

Computer Vision

DCGAN

hyperparam name default value
ADAM lr 2e-4
ADAM momentum beta1 0.5
minibatch size 64 or 128
image scaling [-1, 1]
LeakyReLU slope 0.2
Real labels (label smoothing) 1 -> [0.7, 1.2]
Fake labels (label smoothing 0 -> [0.0, 0.3]
Weight init N(0, 0.02)
Z distribution n-dim uniform or gaussian (e.g., uniform (-0.2, 0.2) from this implementation

For Z, sampling from a uniform distribution is simpler, but see the discussion here about interpolation in the latent space; current recommendation is to use a spherical Z and interpolate via a great circle

Natural Language Processing

Deep Reinforcement Learning

Deep Deterministic Policy Gradient

In the paper, the actor and critic learning rates are reversed. However, to help stabilize the actor network during training, you generally want to encourage the critic network to converge faster; hence the larger initial lr for the critic is suggested here.

hyperparam name default value
actor lr 10e-4
critic lr 10e-3
critic L2 weight decay 10e-2
discount factor 0.99
target network update tau 10e-4
Ornstein-Uhlenbeck theta 0.15
Ornstein-Uhlenbeck sigma 0.3
minibatch size 64 on low-dim input, 16 on pixel-input
replay memory size 10e5
weight init final layer of actor & critic are uniform(-3 * 10-3, 3 * 10-3) for low-dim input and uniform(-3 * 10-4, 3 * 10-4) for pixel-input; other layers -> Xavier

A3C

hyperparam name default value
discount factor 0.99
shared RMSprop eta 7e-4
shared RMSprop alpha 0.99
shared RMSprop epsilon 0.1
A3C entropy regularization beta 0.01
V-network gradients multiplied by 0.5
Weight init Xavier (Torch default)
Reward clipping [-1, 1] on Atari
# of threads w/ best performance 16

General

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.