rl-mountain-car's Introduction

rl-mountain-car

Reinforcement learning solution with the SARSA(\lambda) algorithm for the mountain-car problem.

Goal: help an under-powered car find its way up a steep hill. Since the car has not enough power to directly climb the hill, it has to learn to swing back-and-forth to gain enough momentum to reach the summit.

Implementation: 2-layer neural network with reward-modulated plasticity.

Requirements

Make sure you have the follwing installed:

Alternatively, you can simply run this command to install those dependencies:

pip install -r requirements.txt

Running the code

The best way to get started is by running from the terminal the command

python starter.py

This will trigger an interactive view of the learning trials, using default parameters. The vector field plots will show, before and after training, the direction of the most likely action at evenly-spaced points in the s = (x [m], dx/dt [m/s]) state space of the car. The vectors are overlaid on a contour plot of the total energy of the car as a function of its state. The remaining plots (one for each trial) will depict the trajectories the car took in the state space, the force directions it applied, as well as the total energy at each step of the trial.

Alternatively, one could check the jupyter notebook experiments.ipynb or the script experyments.py for example of code usage and for reproduction of the figures in the report.

Notes

Developed as a mini-project for the course CS-434 "Unsupervised and Reinforcement Learning in Neural Networks", Fall 2016, EPFL.

References

Richard S. and Barto, Andrew G. Reinforcement Learning: An Intro- duction. MIT Press, 1998. ISBN 0262193981. URL.

Recommend Projects

rodrigo-pena / rl-mountain-car Goto Github PK

rl-mountain-car's Introduction

rl-mountain-car

Requirements

Running the code

Notes

References

rl-mountain-car's People

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs