GithubHelp home page GithubHelp logo

vimos's Introduction

Vimos Engine


2023 SWM SeoulDynamics



Contents

  1. ◈ Overview
  2. ◈ Get started
  3. ◈ Results
  4. ◈ Features
  5. ◈ Usages



Overview

Vimos Engine is a solution that provides an API to easily use the pose estimation and 3D mocap technologies required for developing AI services.


Motivation

Existing pose estimation frameworks were primarily developed for research purposes, making them complex and difficult to customize for production. The data was also provided in matrix form, requiring developers to manually index articulation points, and the logic for real-time processing had to be implemented by hand, which made it difficult to achieve user-friendliness and real-world production adoption.

Purpose

Focusing on frequently utilized tasks beyond model implementation, near real-time processing without a separate implementation, we planned the Vimos engine with the goal of eliminating complex interfaces and making it intuitive.



Get started

Installation

You can easily install this solution by referring install.md.



Results

Pose Estimation Samples

3D Mocap Samples

Runtime Analysis

The Vimos engine uses multithreaded IO to minimize bottlenecks that can occur when loading a large number of files when using file I/O. In this case, loading 100,000 (192, 192) RGB images takes about 33% of the time compared to loading them sequentially, and the effect of multithreading is proportional to the number of images being loaded.



Features

The Vimos engine is organized into five sub-modules. Each module functions sequentially to process data, describing the steps typically followed when implementing features that utilize pose estimatoin and 3D mocap.

Module Description
Container Abstracts the data input/output interface to make it easier and simpler for users to fetch and store data.
Editor Provides a scalable abstraction of the pipeline for preprocessing images for input to the model.
Model It consists of two main parts: a Pose model for extracting postures and a Mesh model for extracting 3D human body meshes. Abstracts MMPose and FrankMocap to make them easier to use.
Modifier Scalable abstraction of the pipeline that post-processes the output from the models, either a pose skeleton or a 3D mesh.
Task Provide basic tasks that are redundantly used by multiple services in the form of classes so that users can use them without having to develop their own algorithms.

For further details, check the API docs.



Usages

It's an example application that utilizes this engine: Ping Pong Coach. It is a service that helps users practice their posture based on 3D vision technology. As you practice different stances, such as forehand and backhand, the app gives you feedback by measuring the similarity of your pose based on your camera shot.

vimos's People

Contributors

illuminoplanet avatar willi19 avatar ygr4789 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.