GithubHelp home page GithubHelp logo

romaniakovlev / reinforcement-learning Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dennybritz/reinforcement-learning

0.0 2.0 0.0 5.1 MB

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Home Page: http://www.wildml.com/2016/10/learning-reinforcement-learning/

License: MIT License

Jupyter Notebook 97.05% Python 2.95%

reinforcement-learning's Introduction

Overview

This repository provides code, exercises and solutions for popular Reinforcement Learning algorithms. These are meant to serve as a learning tool to complement the theoretical materials from

Each folder in corresponds to one or more chapters of the above textbook and/or course. In addition to exercises and solution, each folder also contains a list of learning goals, a brief concept summary, and links to the relevant readings.

All code is written in Python 3 and uses RL environments from OpenAI Gym. Advanced techniques use Tensorflow for neural network implementations.

Table of Contents

List of Implemented Algorithms

  • [Dynamic Programming Policy Evaluation](DP/Policy Evaluation Solution.ipynb)
  • [Dynamic Programming Policy Iteration](DP/Policy Iteration Solution.ipynb)
  • [Dynamic Programming Value Iteration](DP/Value Iteration Solution.ipynb)
  • [Monte Carlo Prediction](MC/MC Prediction Solution.ipynb)
  • [Monte Carlo Control with Epsilon-Greedy Policies](MC/MC Control with Epsilon-Greedy Policies Solution.ipynb)
  • [Monte Carlo Off-Policy Control with Importance Sampling](MC/Off-Policy MC Control with Weighted Importance Sampling Solution.ipynb)
  • [SARSA (On Policy TD Learning)](TD/SARSA Solution.ipynb)
  • [Q-Learning (Off Policy TD Learning)](TD/Q-Learning Solution.ipynb)
  • [Q-Learning with Linear Function Approximation](FA/Q-Learning with Value Function Approximation Solution.ipynb)
  • [Deep Q-Learning for Atari Games](DQN/Deep Q Learning Solution.ipynb)
  • [Double Deep-Q Learning for Atari Games](DQN/Double DQN Solution.ipynb)
  • Deep Q-Learning with Prioritized Experience Replay (WIP)
  • [Policy Gradient: REINFORCE with Baseline](PolicyGradient/CliffWalk REINFORCE with Baseline Solution.ipynb)
  • [Policy Gradient: Actor Critic with Baseline](PolicyGradient/CliffWalk Actor Critic Solution.ipynb)
  • [Policy Gradient: Actor Critic with Baseline for Continuous Action Spaces](PolicyGradient/Continuous MountainCar Actor Critic Solution.ipynb)
  • Deterministic Policy Gradients for Continuous Action Spaces (WIP)
  • Deep Deterministic Policy Gradients (DDPG) (WIP)
  • Asynchronous Advantage Actor Critic (A3C)

Resources

Textbooks:

Classes:

Talks/Tutorials:

Other Projects:

Selected Papers:

reinforcement-learning's People

Contributors

absolutelynowarranty avatar ajschumacher avatar alvarosg avatar andytwigg avatar bfontaine avatar bruinbear avatar david1309 avatar dennybritz avatar ei-grad avatar fengyuleidian0615 avatar j-min avatar kismuz avatar rianrajagede avatar rockingdingo avatar ronaldseoh avatar shadowen avatar shivamvats avatar sstarzycki avatar tigerneil avatar yenchenlin avatar zuzoovn avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.