GithubHelp home page GithubHelp logo

davidbasswwu / auto-image-caption-for-web-using-machine-learning Goto Github PK

View Code? Open in Web Editor NEW

This project forked from mikahayk/auto-image-caption-for-web-using-machine-learning

1.0 1.0 0.0 34.22 MB

Chrome extension that uses machine learning to fix missing Alt Texts on the images

JavaScript 100.00%

auto-image-caption-for-web-using-machine-learning's Introduction

Auto Image Caption for Web

A Chrome Extension that uses machine learning to auto caption images and fix missing Alt Texts

a11y-extension-cover

DESCRIPTION

Digital accessibility ensures that websites, web apps, and digital content can be used by people with a diverse range of hearing, movement, sight or cognitive abilities. One way to promote digital accessibility is by using alt text (alternative text), which provides a text alternative to non-text content in web pages including images, media, etc. Alt text can be challenging to audit, edit and/or update in existing websites. This Chrome extension will automate this process by using machine learning and image detection. IM2TXT captioning is the model used in this project.

IM2TXT Model

The image encoder is a deep convolutional neural network. This type of network is widely used for image tasks and is currently state-of-the-art for object recognition and detection. Our particular choice of network is the Inception v3 image recognition model pretrained on the ILSVRC-2012-CLS image classification dataset. The decoder is a long short-term memory (LSTM) network. This type of network is commonly used for sequence modeling tasks such as language modeling and machine translation. In the Show and Tell model, the LSTM network is trained as a language model conditioned on the image encoding.

INSPIRATION

REFERENCES

AUDIENCE

  • People who utilize a screenreader to access alt text
  • People who need update alt text retroactively to comply with digital accessibility standards

NEXT STEPS

  • Make it a WP plugin
  • Generate images based on labels
  • Use ML to provide a better screen reader experience
  • Retain model on web semantics

INSTRUCTIONS

  1. Download this repo.
  2. Archive it into a .zip file.
  3. Go to chrome://extensions/ and enable the extension.
  4. Open any webpage.
  5. Run the extension.

by Hayk Mikayelyan, Abi Muñoz.
Thank you Yining Shi, Lauren Race, Ellen Nickels for helping us with this project.

auto-image-caption-for-web-using-machine-learning's People

Contributors

mikahayk avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.