maximevandegar / rl-adventure-2 Goto Github PK

View Code? Open in Web Editor NEW

PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay

Jupyter Notebook 98.92% Python 1.08%

rl-adventure-2's Introduction

RL-Adventure-2: Policy Gradients

PyTorch tutorial of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay

The deep reinforcement learning community has made several improvements to the policy gradient algorithms. This tutorial presents latest extensions in the following order:

Advantage Actor Critic (A2C)

High-Dimensional Continuous Control Using Generalized Advantage Estimation

Proximal Policy Optimization Algorithms

Sample Efficient Actor-Critic with Experience Replay

Continuous control with deep reinforcement learning

Addressing Function Approximation Error in Actor-Critic Methods

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

Generative Adversarial Imitation Learning

Hindsight Experience Replay

If you get stuck…

Remember you are not stuck unless you have spent more than a week on a single algorithm. It is perfectly normal if you do not have all the required knowledge of mathematics and CS.
Carefully go through the paper. Try to see what is the problem the authors are solving. Understand a high-level idea of the approach, then read the code (skipping the proofs), and after go over the mathematical details and proofs.

RL Algorithms

Deep Q Learning tutorial: DQN Adventure: from Zero to State of the Art Awesome RL libs: rlkit @vitchyr, pytorch-a2c-ppo-acktr @ikostrikov, ACER @Kaixhin

Best RL courses

Berkeley deep RL link
Deep RL Bootcamp link
David Silver's course link
Practical RL link

Recommend Projects

maximevandegar / rl-adventure-2 Goto Github PK

rl-adventure-2's Introduction

RL-Adventure-2: Policy Gradients

If you get stuck…

RL Algorithms

Best RL courses

rl-adventure-2's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs