GithubHelp home page GithubHelp logo

pondjames007 / lostintranslation Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 1.0 21.4 MB

NYU ITP 2019 Thesis. An interactive experience to see how machine interpret one thing differently from human

Home Page: https://itp.nyu.edu/thesis2019/#JamesHuang

Python 37.85% JavaScript 31.13% HTML 31.01%
python3 javascript flask machine-learning im2txt sketchrnn attngan wordvectors spacy

lostintranslation's Introduction

Lost in Translation

NYU ITP 2019 Thesis
An interactive experience to see how machine interpret one thing differently from human.

Presentation Video in ITP Thesis Week 2019

Introduction

The project has a recursive process for human and machine to interpret each other’s results. Human needs to come up with a sentence to describe an image generated by machine and the machine will do multiple machine learning translations from the description from human to a sketch and then to an image in each round of process.

Inspiration

Telephone Game

An example of multiple translations
Drawception - Picture Telephone Drawing Game

Closed Loop

A project uses machine learning to do feedback loop on images and texts.
Jake Elwes - Closed Loop

Implementation

  • Python Server with Flask
  • Javascript Client
  • Generate a sentence from an image by im2txt
  • Find word tags and get nouns by SpaCy
  • Word Vector similarity by SpaCy
  • Draw doodles by SketchRNN
  • Generate new images by AttnGan

app.py

Server code
Coordinate and process most of the data.
Use http connection to communicate with Runway and Client.

static/client.js

Client Code
Present the result and collect user input.

categories.json

A Json file that store all sketch categories

draw_strokes.py

functions to draw sketch

drawSketch.py

a test function to draw sketch

im2txt

A machine learning model that can generate a sentence based on an image.
The model is originated from models/research/im2txt. A pre-trained model is provided in Runway.

SketchRNN

A machine learning model that can generate doodle in specific categories.
The doodle data is from Quick, Draw! The Data and the model detail is from Magenta - SketchRNN.
It is downloaded from Google Cloud Platform.

AttnGan

The model is from GitHub - taoxugit/AttnGAN.
A machine learning model that can generate image from a sentence.
A pre-trained model is provided in Runway.

lostintranslation's People

Contributors

pondjames007 avatar

Watchers

 avatar

Forkers

joiike

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.