GithubHelp home page GithubHelp logo

kyomangold / ai-gesturecontrol Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 0.0 99.49 MB

Control your computer using hand gestures with AI, using Google's MediaPipe and OAK-D Lite camera.

Python 92.67% Shell 0.01% PowerShell 0.03% Cython 2.37% C 4.44% C++ 0.32% Meson 0.01% Fortran 0.10% Forth 0.01% Objective-C 0.03%
computer-vision gesture-control hand-pose-estimation machine-learning on-device-ml stereo-vision

ai-gesturecontrol's Introduction

AI Hand-Gesture Controlled Cursor with OAK-D Lite

This system integrates advanced AI capabilities, including a machine learning algorithm running on-device, to allow users to control their computer cursor through hand and gesture tracking, leveraging Google's MediaPipe and the OAK-D Lite camera's processing.

Features

  • Control the cursor with hand gestures.
  • Different gestures for moving, clicking, and scrolling.
  • Efficient processing on OAK-D Lite's onboard chip.
  • Intuitive, natural human-computer interaction.

Installation

To install the necessary dependencies, ensure you have Python 3 and pip installed on your system. Then run the following commands:

python3 -m pip install -U pip
python3 -m pip install --extra-index-url https://artifacts.luxonis.com/artifactory/luxonis-python-snapshot-local/ depthai
pip install -r requirements.txt

Make sure your OAK-D Lite camera is connected to your computer before proceeding.

Usage

The system recognizes specific hand poses to control the cursor:

  • Move: FIVE pose (open hand, move around freely)
  • Click: FIST pose (close fist)
  • Scroll: PEACE pose (index and middle finger together, move up and down)

Running the Application

  1. To run the application, use the following command:

    python3 mouse_controller.py

    The system will start in headless mode by default, with no display output.

  2. To enable a real-time rendering of the hand tracking, use the -r flag:

    python3 mouse_controller.py -r

    This will open a new window showing the live hand tracking process.

Ensure your hand is in the view of the OAK-D Lite camera, and the system will track your hand movements, interpreting them to control the cursor accordingly.

Contributions and Acknowledgements

This project was inspired by the depthai_hand_tracker repository, and I acknowledge the groundbreaking work they have shared with the community and do not claim any rights for their work. Contributions to enhance functionality or performance are warmly welcomed. Feel free to submit issues or pull requests.

Disclaimer

This system is a demonstration of AI and computer vision capabilities with the OAK-D Lite camera. It is not intended for critical use cases. Use at your own discretion and risk.

ai-gesturecontrol's People

Contributors

kyomangold avatar

Stargazers

 avatar

Watchers

Kostas Georgiou avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.