frozenlake_nus's Introduction

The Froze Lake Problem and Variations

Project 1 of #ME5406 Deep Learning for Robotics# @NUS @MIT License

For more information, check the report.

MC 4x4

Trail (max=1000)	1	2	3	4	5	6	7	8	9	10
First Reach	247	7	172	/	2	/	/	287	113	12
Optimal Policy	524	11	240	/	4	/	/	738	194	45

SARSA 4x4

Trail (max=100)	1	2	3	4	5	6	7	8	9	10
First Reach	6	4	11	9	17	9	8	7	8	9
Optimal Policy	33	20	23	17	65	16	54	29	17	13

Q-learning 4x4

Trail (max=100)	1	2	3	4	5	6	7	8	9	10
First Reach	12	15	12	7	6	8	11	20	7	2
Optimal Policy	19	26	24	20	14	14	15	30	15	14

SARSA 10x10

Trail (max=2000)	1	2	3	4	5	6	7	8	9	10
First Reach	274	266	379	236	227	188	144	207	173	284
First Optimal Policy	360	274	394	667	295	551	208	415	257	690
Steady Optimal Policy	883	436	611	952	/	1020	818	1048	759	1383

Q-learning 10x10

Trail (max=2000)	1	2	3	4	5	6	7	8	9	10
First Reach	632	207	691	750	228	620	735	937	660	1004
First Optimal Policy	961	773	928	1001	741	1034	859	1328	963	1407
Steady Optimal Policy	961	773	928	1001	741	1060	868	1337	963	1407

SARSA with/without penalty 10x10

Items	Without penalty	With penalty
Average step length	69.2	17.4
Maximum step length	2080.7	135.6
Time spent	4.32s	1.52s

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.

Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

TensorFlow

An Open Source Machine Learning Framework for Everyone

Django

The Web framework for perfectionists with deadlines.

Laravel

A PHP framework for web artisans

D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

web

Some thing interesting about web. New door for the world.

server

A server is a program made to process requests and deliver data to clients.

Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

Visualization

Some thing interesting about visualization, use data art

Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.

Microsoft

Open source projects and samples from Microsoft.

Google

Google ❤️ Open Source for everyone.

Alibaba

Alibaba Open Source for everyone

D3

Data-Driven Documents codes.

Tencent

China tencent open source team.

wyzh98 / frozenlake_nus Goto Github PK