julian-wyatt / dlrl-coursework Goto Github PK

View Code? Open in Web Editor NEW

Durham University Year 3 Deep Learning and Reinforcement Learning Coursework. Generative model to produce images of a pegasus. Reinforcement Learning agent to play Atari's Gravitar

Jupyter Notebook 99.26% Python 0.74%

dlrl-coursework's Introduction

DLRL Coursework

Deep Learning

For the Deep Learning sub-module, we were tasked to build a generative model, using of the CIFAR10 (32x32) & STL10 (96x96) image datasets to develop pictures of a pegasus.

Abstract from my report:

This paper proposes using a GLO model to generate images that look like a Pegasus. This model uses a basic generator architecture along with the Laplacian loss function, an informative loss equation which helps at a range of image resolutions. Then, we map each image to a point on the n-dimensional spherical latent space, acting as our probability distribution. Using this spherical distribution, we examine the latent space between horses and birds with the intention of generating a smooth interpolation of the two images.

To view the full paper please visit: https://julianwyatt.co.uk/DLRL

Comments:

I quickly learned however that this model was not able to produce high quality samples as the space between image latents was not meaningful. For example see below the high quality reconstructions vs low quality interpolations:

Image	Info
	Best Image Reconstruction
	Best Image Interpolation

Reinforcement Learning

For the module, we were tasked to develop an RL agent to play Atari Gravitar. The aim of Gravitar is complex, even for humans. The player must meander around gravity, visiting planets, collecting fuel, all while shooting other space ships. This makes the game notoriously difficult for agents to learn due to the sparse, long term reward structure. Therefore, I looked towards sequential models such as LSTMs from recent papers such as R2D2. Please see https://www.youtube.com/embed/oTNwsHRPOeM for my most intuitive gameplay sequence.

Recommend Projects

julian-wyatt / dlrl-coursework Goto Github PK

dlrl-coursework's Introduction

DLRL Coursework

Deep Learning

Abstract from my report:

Comments:

Reinforcement Learning

dlrl-coursework's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs