GithubHelp home page GithubHelp logo

frozenlake_nus's Introduction

The Froze Lake Problem and Variations

Project 1 of #ME5406 Deep Learning for Robotics# @NUS @MIT License

For more information, check the report.

  • MC 4x4
Trail (max=1000) 1 2 3 4 5 6 7 8 9 10
First Reach 247 7 172 / 2 / / 287 113 12
Optimal Policy 524 11 240 / 4 / / 738 194 45
  • SARSA 4x4
Trail (max=100) 1 2 3 4 5 6 7 8 9 10
First Reach 6 4 11 9 17 9 8 7 8 9
Optimal Policy 33 20 23 17 65 16 54 29 17 13
  • Q-learning 4x4
Trail (max=100) 1 2 3 4 5 6 7 8 9 10
First Reach 12 15 12 7 6 8 11 20 7 2
Optimal Policy 19 26 24 20 14 14 15 30 15 14
  • SARSA 10x10
Trail (max=2000) 1 2 3 4 5 6 7 8 9 10
First Reach 274 266 379 236 227 188 144 207 173 284
First Optimal Policy 360 274 394 667 295 551 208 415 257 690
Steady Optimal Policy 883 436 611 952 / 1020 818 1048 759 1383
  • Q-learning 10x10
Trail (max=2000) 1 2 3 4 5 6 7 8 9 10
First Reach 632 207 691 750 228 620 735 937 660 1004
First Optimal Policy 961 773 928 1001 741 1034 859 1328 963 1407
Steady Optimal Policy 961 773 928 1001 741 1060 868 1337 963 1407
  • SARSA with/without penalty 10x10
Items Without penalty With penalty
Average step length 69.2 17.4
Maximum step length 2080.7 135.6
Time spent 4.32s 1.52s

Learning curve

Success rate

Training curve

Extracted policy and heat map

frozenlake_nus's People

Contributors

wyzh98 avatar

Stargazers

 avatar Wang Jiawen avatar  avatar  avatar Fu Yangqing avatar

Watchers

Kostas Georgiou avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.