GithubHelp home page GithubHelp logo

wongkwan / simple-effective-3dpose-baseline Goto Github PK

View Code? Open in Web Editor NEW

This project forked from lck1201/simple-effective-3dpose-baseline

0.0 0.0 0.0 30.76 MB

My own Gluon reimplement of "A simple yet effective baseline for 3d human pose estimation"

License: MIT License

Python 100.00%

simple-effective-3dpose-baseline's Introduction

A simple yet effective baseline for 3D human pose estimation

My own Gluon reimplement of A simple yet effective baseline for 3D human pose estimation
Here is the original implementation

Todo:

  • Provide trained model
  • Provide results on 2D pose estimates as input

Enviroments

python 3.7
mxnet-cu90 1.4.0
CUDA 9.0

Dependency

pip install pyyaml
pip install scipy
pip install matplotlib
pip install easydict

Dataset

  1. Baidu Disk (code: kfsm) or Google Drive to download the HM3.6M annotation
  2. Unzip data under data folder, and organize like this
${PROJECT_ROOT}
    `--data
        `--annot
            `--s_01_act_02_subact_01_ca_01
            `--s_01_act_02_subact_01_ca_02
            `-- ......
            `-- ......
            `-- ......
            `--s_11_act_16_subact_02_ca_04            

How-to-use

You can download my trained model from Google Drive, which MPJPE is 44.9mm.

usage: train.py/test.py [-h] --gpu GPU --root ROOT --dataset DATASET [--model MODEL]
                        [--debug DEBUG]

optional arguments:
  -h, --help         show this help message and exit
  --gpu GPU          GPUs to use, e.g. 0,1,2,3
  --root ROOT        /path/to/project/root/
  --dataset DATASET  /path/to/your/dataset/root/
  --model MODEL      /path/to/your/model/, to specify only when test
  --debug DEBUG      debug mode

Train: python train.py --root /project-root --gpu /gpu-to-use

Test: python test.py --root /project-root --gpu /gpu-to-use --model /model-path

PS: You can modify default configurations in config.py. Because it's a quite simple system, not many hyperparameters need to be tuned.

Results

Since I don't have 2D pose estimate results on HM3.6M, I just experiment with 2D ground truth as input. My best result is 44.9mm(no augment is used), slightly better than 45.5mm reported by paper.

Method Avg Direct Discuss Eating Greet Phone Photo Pose Purch Sitting SittingD Smoke Wait WaitD Walk WalkT
My Result 44.9 36.8 43.5 40.5 43.0 46.2 54.7 40.0 43.6 52.9 59.7 44.2 44.5 45.0 34.6 37.3
Paper 45.5 37.7 44.4 40.3 42.1 48.2 54.9 44.4 42.1 54.6 58.0 45.1 46.4 47.6 36.4 40.4

Figure1

Figure2

Figure3

simple-effective-3dpose-baseline's People

Contributors

lck1201 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.