GithubHelp home page GithubHelp logo

julian-wyatt / dlrl-coursework Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 12.62 MB

Durham University Year 3 Deep Learning and Reinforcement Learning Coursework. Generative model to produce images of a pegasus. Reinforcement Learning agent to play Atari's Gravitar

Home Page: https://julianwyatt.co.uk/DLRL

Jupyter Notebook 99.26% Python 0.74%
pytorch reinforcement-learning generative-model

dlrl-coursework's Introduction

DLRL Coursework

Deep Learning

For the Deep Learning sub-module, we were tasked to build a generative model, using of the CIFAR10 (32x32) & STL10 (96x96) image datasets to develop pictures of a pegasus.

Abstract from my report:

This paper proposes using a GLO model to generate images that look like a Pegasus. This model uses a basic generator architecture along with the Laplacian loss function, an informative loss equation which helps at a range of image resolutions. Then, we map each image to a point on the n-dimensional spherical latent space, acting as our probability distribution. Using this spherical distribution, we examine the latent space between horses and birds with the intention of generating a smooth interpolation of the two images.

To view the full paper please visit: https://julianwyatt.co.uk/DLRL

Comments:

I quickly learned however that this model was not able to produce high quality samples as the space between image latents was not meaningful. For example see below the high quality reconstructions vs low quality interpolations:

Image Info
Best Image Reconstruction
Best Image Interpolation

Reinforcement Learning

For the module, we were tasked to develop an RL agent to play Atari Gravitar. The aim of Gravitar is complex, even for humans. The player must meander around gravity, visiting planets, collecting fuel, all while shooting other space ships. This makes the game notoriously difficult for agents to learn due to the sparse, long term reward structure. Therefore, I looked towards sequential models such as LSTMs from recent papers such as R2D2. Please see https://www.youtube.com/embed/oTNwsHRPOeM for my most intuitive gameplay sequence.

dlrl-coursework's People

Contributors

julian-wyatt avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.