GithubHelp home page GithubHelp logo

vpro-gids-gan-covers's Introduction

VPRO Gids GAN Covers

Generating VPRO Gids covers using Generative Adversarial Networks

For inquiries, shoot me an email or contact me on LinkedIn. Also check out this page on the VPRO Gids website with additional information.

Picture: A selection of generated VPRO Gids covers.

This repository contains information about my StyleGAN application to VPRO Gids covers. I used StyleGAN, SRFBN and Super-SloMo to generate (moving) cover art for the 14th VPRO Gids edition of 2020.

VPRO Gids

Picture: Example VPRO Gids covers from the training dataset.

VPRO Gids is the weekly television guide of Dutch broadcast station VPRO. VPRO has played a crucial part in Dutch design history with the guide being a focal point. I was asked to create an AI generated cover for the guide. This repository details the techniques I used to generate high-resolution VPRO Gids covers and latent space visualization videos.

Method

StyleGAN was used for image genration, SRFBN to create super-resolution images from the StyleGAN output and Super-SloMo to create slow motion versions of latent space walk videos.

StyleGAN is a generative model capable of generating realistic images of people, characters, and more. More information can be found here.

In this application, StyleGAN was trained on a dataset of 9.5 years of historic VPRO covers. Because of training time constraints, a StyleGAN model with an output size of 512x512 was used. Since there are hardly any constant features across covers except for the logo, the expected output is artistic rather than realistic. Here is a PyTorch implementation of StyleGAN you can use to train your own model. Create your own dataset and follow the instructions to preprocess your data and create a generative model.

Feedback Network for Image Super-Resolution (SRFBN) and some post-processing was used to create super-resolution images of the low-resolution StyleGAN output. SRFBN is an image super-resolution feedback network (SRFBN) to refine low-level representations with high-level information. SRFBN was applied to the StyleGAN output, with a magnification of x4, followed by a resizing and resampling of the output (using ImageMagick) to create images of size 285x215 mm with 300dpi.

To better visualize the generative capabilities of the StyleGAN model, a latent space walk was created to create videos that look like generated covers morphing into each other. To create a more detailed latent space visalization of the StyleGAN output in the time domain, Super-SloMo was used. Super-SloMo is an end-to-end convolutional neural network for variable-length multi-frame video interpolation, where the motion interpretation and occlusion reasoning are jointly modeled. For this project, Super-SloMo was applied to increase the number of frames by 4x. This PyTorch implementation is useful for running your own experiments.

Animated GIF Examples of latent space visualization:

Pictures: Animated GIF examples of latent space visualization of the StyleGAN model using SRFBN and Super-SloMo. Note: these are reduced in resolution and frames per second.

Video of latent space visalization:

Picture: VPRO Gids Artificial Intelligence Cover Video on Vimeo

Resources

Related material is available via the following links:

StyleGAN:

Feedback Network for Image Super-Resolution

Super-SloMo

vpro-gids-gan-covers's People

Contributors

hvkoops avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.