GithubHelp home page GithubHelp logo

mayurikolhe-2003 / pbl-speech-assistance Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 4.33 MB

Speech Assistance for Blind People is a Python application that leverages the YOLO (You Only Look Once) object detection model and OpenCV to assist visually impaired individuals in navigating their surroundings. The application detects and recognizes objects in real-time and provides spoken descriptions to the user through a text-to-speech interfac

Python 100.00%

pbl-speech-assistance's Introduction

Speech Assistance for Blind People

Speech Assistance for Blind People is a Python application that leverages the YOLO (You Only Look Once) object detection model and OpenCV to assist visually impaired individuals in navigating their surroundings. The application detects and recognizes objects in real-time and provides spoken descriptions to the user through a text-to-speech interface.

Features

  • Real-time Object Detection: The application uses YOLO to detect and recognize objects in the user's environment.
  • Text-to-Speech (TTS) Integration: It provides audible descriptions of detected objects using TTS.
  • User-Friendly Interface: Designed with simplicity in mind to ensure ease of use for visually impaired individuals.
  • Customizable Voice Output: Users can configure the TTS voice and speech rate to suit their preferences.
  • Object Recognition: In addition to detection, the application recognizes and identifies objects when possible.
  • Detect Humans: Can spell the name of the person who is standing in front of you.
  • Voice Assistant: Result is assisted to person through audio output.

Steps

  1. Run the 01_face_dataset.py
  2. - Add the unique id in the terminal(for ex: 1,2,3,...).
  3. Run the 02_face_training.py
  4. - The faces generated in the dataset folder will be trained.
  5. Run 03_face_recognition.py
  6. - add your name in the list (names = [none,"Shreyas"]) depending on the number of faces trained in the model.
    - run the file, following output will be displayed on the screen.

Output Screen

Animated GIF-downsized_large

pbl-speech-assistance's People

Contributors

blackbolt2003 avatar mayurikolhe-2003 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.