simple-effective-3dpose-baseline's Introduction

A simple yet effective baseline for 3D human pose estimation

My own Gluon reimplement of A simple yet effective baseline for 3D human pose estimation
Here is the original implementation

Todo:

Provide trained model
Provide results on 2D pose estimates as input

Enviroments

python 3.7
mxnet-cu90 1.4.0
CUDA 9.0

Dependency

pip install pyyaml
pip install scipy
pip install matplotlib
pip install easydict

Dataset

Baidu Disk (code: kfsm) or Google Drive to download the HM3.6M annotation
Unzip data under data folder, and organize like this

${PROJECT_ROOT}
    `--data
        `--annot
            `--s_01_act_02_subact_01_ca_01
            `--s_01_act_02_subact_01_ca_02
            `-- ......
            `-- ......
            `-- ......
            `--s_11_act_16_subact_02_ca_04

How-to-use

You can download my trained model from Google Drive, which MPJPE is 44.9mm.

usage: train.py/test.py [-h] --gpu GPU --root ROOT --dataset DATASET [--model MODEL]
                        [--debug DEBUG]

optional arguments:
  -h, --help         show this help message and exit
  --gpu GPU          GPUs to use, e.g. 0,1,2,3
  --root ROOT        /path/to/project/root/
  --dataset DATASET  /path/to/your/dataset/root/
  --model MODEL      /path/to/your/model/, to specify only when test
  --debug DEBUG      debug mode

Train: python train.py --root /project-root --gpu /gpu-to-use

Test: python test.py --root /project-root --gpu /gpu-to-use --model /model-path

PS: You can modify default configurations in config.py. Because it's a quite simple system, not many hyperparameters need to be tuned.

Results

Since I don't have 2D pose estimate results on HM3.6M, I just experiment with 2D ground truth as input. My best result is 44.9mm(no augment is used), slightly better than 45.5mm reported by paper.