GithubHelp home page GithubHelp logo

helmogey / drlnd-p1-banana Goto Github PK

View Code? Open in Web Editor NEW

This project forked from doctorcorral/drlnd-p1-banana

0.0 1.0 0.0 22.67 MB

Project 1 on DQN for Udacity - Deep Reinforcement Learning Nanodegree

ASP 82.08% Python 1.73% Jupyter Notebook 16.19%

drlnd-p1-banana's Introduction

Banana Navigation Project for DRLND Udacity Nanodegree


This is the first project in the Udacity Deep Reinforcement Learning Nanodegree. It requires students to develop and train a Deep Q-Network (DQN) model to collect yellow bananas in a simulator.

This project is the first to be solved in the Udacity Deep Reinforcement Learning Nanodegree. It is based in an implementation of a Deep Q-Network (DQN) model. The environment is modeled in Unity, and the task is to train an agent to collect yellow bananas (getting a reward of +1) and avoid non yellow ones (getting -1 reward) c:

Project Details

The simulation contains a single agent that navigates a large environment. At each time step, it has four actions at its disposal:

  • 0 - walk forward
  • 1 - walk backward
  • 2 - turn left
  • 3 - turn right

The state space has 37 dimensions and contains the agent's velocity, along with ray-based perception of objects around agent's forward direction. A reward of +1 is provided for collecting a yellow banana, and a reward of -1 is provided for collecting a blue banana.

The environment is considered solved when the average reward (over the last 100 episodes) is at least +13.

Getting Started

See the instrucions here to set up your environment instructions here

It also requires Unity ML-Agents, NumPy and PyTorch

Get the environment matching your OS :

Linux: click here Mac OSX: click here Windows (32-bit): click here Windows (64-bit): click here

Use full path file reference for such environment. Note that Banana.app is already included in this repo, so it can be imported with:

env = UnityEnvironment(file_name="Banana.app")

Instructions

Then run the navigation_banana.ipynb notebook using the drlnd kernel to train the DQN agent.

After trainig the model, parameters will be dumpt to checkpoint.pth and will be used by the trained agent.

drlnd-p1-banana's People

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.