GithubHelp home page GithubHelp logo

viz-aid-ai-for-visually-impaired's Introduction

AI-powered voice assistant to assist the visually impaired in their daily tasks

AI Powered Vioice Assistant for Who are Blind or Have Low-Vision

be my eyes

power to the 253 million people who are blind or have low vision globally. Image to text technology

The tool can offer suggestions for recipes based on the contents of a user's refrigerator, for example, and provide a step-by-step guide on how to make them.

Quick Read

Scan documents

Currency scanner & counter

Barcode scanning

[1st] (image

[2nd] (image

Features: Detecting 80 different categorical object types with their confidence level Priority-based detection Distance prediction & scaling algorithm Mobile phone activated camera for various items Speech-activated mode selection Multi-threading usage for asynchronous voice warnings and feedbacks

Modes: Aware mode: continuously speak out loud 3 items with the topmost priority Warn mode: only shout out warnings if any item is too close based on priority Search mode: focus on searching a particular item until it is found

Features

  • Detecting 80 different categorical object types with their confidence level
  • Priority-based detection
  • Distance prediction & scaling algorithm
  • Mobile phone activated camera for various items
  • Speech-activated mode selection
  • Multi-threading usage for asynchronous voice warnings and feedbacks
  • SSD Mobilenet COCO API Model

cocoapimodel

Modes

  • Aware mode: continuously speak out loud 3 items with the topmost priority
  • Warn mode: only shout out warnings if any item is too close based on priority
  • Search mode: focus on searching a particular item until it is found

Resources Links

Please click the link below to view our other resources regarding to this project submission.

  1. Submission Website
  2. Demonstration Video
  3. Presentation Slides

How To Run

  1. Clone this repository and make sure you have all the files downloaded

  2. Set your environment and install the required python packages

    It is recommended for you to use Anaconda, and specifically python version 3.9.0 (newer version of python has some compatibility issues related to the collections package, other older python version is untested and might have some compatibility issues as well). Then, install all the required python packages by typing:

        pip install -r requirements.txt
  3. Execute main.py or main.ipynb file

    If you want to use an external camera, you can connect via IP Webcam Pro and edit the url variable when running the model. By default, it is connected to your PC / Laptop default camera. You can also toggle the USE_SPEECH global variable to True or False. If it is True, it will use speech recognition to choose and change the mode, or else it will manually prompt you for the mode input.

viz-aid-ai-for-visually-impaired's People

Contributors

dhairyarungta avatar elvinagam avatar michac789 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.