GithubHelp home page GithubHelp logo

rl's Introduction

Algos

See: https://spinningup.openai.com/en/latest/algorithms/sac.html

  • Rainbow start. DQN = Q(s,a) + e-greedy + experience-replay(1 million) + RMSProp + target-network(copy)/online-network

    • Double Q-learning: q value with target-network is evaluated at the action from online-network
    • Prioritized replay: samples with prob relative to last absolute TD error
    • Dueling Network: Q(s,a) = V(s) + Advantage(s,a)
    • Multi-step learning: Q(s_t, a) is updated from Q(s_[t+n], a)
    • Distributional RL: Use a distribution P(s,a) instead of Q(s,a), and update with D_KL.
    • Noisy Network: Add noise terms, anneal over time.
  • VPG: on-policy, PolicyFunction + ValueFunction(s,a)

  • TRPO: on-policy, additional policy update step-size constraint terms

  • PPO: same idea as TRPO, better loss term

  • DDPG: off-policy, only works with continuous actions, learns two fu nctions Q(s,a)+u(s). Policy is learnt by gradient-ascent on Q(s,mu(a)),

  • TD3 (Twin Delayed DDPG): Add 3 tricks to DDPG

  • SAC: off-policy, only works with continuous actions (but can be altered for discrete case). Learns Q(s,a) and pi(s). Q(s,a) has additional entropy(policy) term.

Impls

  • sac-basic.py: replay buffer, w/o target networks
  • sac.py: sac-basic + target-networks

rl's People

Contributors

elbaro avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.